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ABSTRACT 

Schlaifer's  text,  though  considered  one  of  the  "best 
for  students  of  Business  Statistics,  nevertheless  is 
difficult  for  many  students  to  understand.   This  is  part- 
icularly true  of  those  who  are  returning-  to  college  studies 
after  a  considerable  lay-off.   The  authors  of  this  Research 
Paper  felt  that  future  students  might  "be  aided  "by  technical 
notes  in  which  the  material  is  presented  in  a  somewhat 
doT'in-to-earth  manner  that  would  be  easier  to  understand 0 
The  Technical  Notes  supplement  Schlaifer's  text  and  cover 
the  same  concepts;  that  is,  decision  under  uncertainty; 
the  use  of  probabilities  based  directly  on  experience ; 
binomial  sampling,  the  normal  distribution  and  sampling 
of  measured  values a 


INTRODUCTIO] 

The  purpose  of  these  Technical  Notes  is  to  supplement 
the  basic  theory  and  explanations  of  the  basic  text,  In- 
troduction to  Statistics  for  Business  Decisions,  Robert 
Schlaifer,  McGraw-Hill  Book  Company,  1961,  York,  Pennsyl- 
vania. 

These  Technical  Notes  are  primarily  for 

(1)  The  average  student  who  may  have 
never  been  exposed  to  statistical 
theory, 

(2)  The  student  ~ho   has  been  away  from. 
school  for  a  number  of  years, 

(3)  The  student  TTho  finds  Schlaifer  a 
bit  too  rapid  for  good  digest  ion 0 

The  titles  to  the  chapters  are  congruent  with  the 
titles  given  in  the  basic  text.   An  attempt  has  been  made 
to  introduce  problems  to  further  illustrate  the  basic 
theory  presented,  by  Professor  Schlaifer  in  order  that  the 
student  may  gain  greater  facility  in  problem  setups,  and 
gain  a  deeper  sense  of  familiarity  and  feeling  foi  statis- 
tics as  it  applies  to  the  everyday  application  both  in  the 
Navy  and  outside  of  it. 

Professors  Freund  ?nd  Williams  describe  the  relation- 
ship of  business  statistics  in  modern  business: 

.  ,  „  most  elementary  textbooks  stressed  th<   Dllect- 
ion  and  sources  of  data  and  the  presentation  of 
numerical  data  in  tables,  charts,  pictures,  and  maps e 


Although  the  importance  of  statistics  in  this 
phase  of  business  cannot  be  denied,  there  has 
been  a  pronounced  change  in  attitude  to  <: \   the 
subject:  business  statistics  is  now  viewed  as 
providing  quantitative  bases  for  arriving  at 
well-informed  decisions  with  respect  to  all 
matters  connected  with  the  operation  of   business 
In  short ,  it_  is  now  recognized  that ,  in  itjs 
broadest  sense,  business  statistics  includes 
also  the  methods  and  inferences  needed  to  pro- 


vide  an  adequate  flow  of  gjialitv^  raw  mater ials_ , 
JL9_  select  and  evaluate  the  pej^forniance  of  both 
machines  and  personnel,  to  design  products  and 
maintain  their  quality,  and  t_o  evaluate  new 
methods  of  production,  advertising,  and  selling 
$^)JL   goods  and  services  of  Indus  try  y  both  in  the 
immediate  present  and  in  the  near  and  distant 
future. L 

There  are  several  types  of  statistics,  such  as, 

descriptive  statistics,  inductive  statistics,  vital 

statistics,  and  quantum  statistics .   This  Technical  Notes 

are  primarily  concerned  TTith  inductive  statistics,   Descrip- 

tive  statistics  is  understood  to  mean  the  treatment  of 

any  numerical  data  which  does  not  involve  generalizations 0 

When  predictions,  estimations,  or  generalizations  are  made, 

we  are  speaking  of  inductive  statistics,   For  example  ,  take 

five  light  bulbs  of  brand  A  and  five  light  bulbs  of  brand 

B  and  burn  them  continuously  until  they  burn  out.   We  can 

easily  determine  the  average  lifetime  of  the  five  bulbs  of 

each  brand.   Suppose  brand  A  produced  the  folio-Ting  re- 


1 

Professor  John  E«  Freund,  and  Professor-  Frank  J, 

Williams.   Modern  Business  Statistics c  Englewood  Cliffs, 
New  Jersey:  Prentice-Hall,  Incc,  1958*  P-  20 
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INTRODUCTION 

The  purpose  of  this  syllabus  is  to  supplement  the 
basic  theory  and  explanations  of  the  basic  text,  Introduct*- 
ion  to  Statistics  for  Business  Decisions,  Robert  Schlaifer, 
McGraw-Hill  Book- Company,  196l,  York,  Pennsylvania. 

This  syllabus  is  primarily  for 

(1)  The  average  student  who  may  have 
never  been  exposed  to  statistical 
theory. 

(2)  The  student  who  has  been  away  from 
school  for  a  number  of  years. 

(3)  The  student  who  finds  Schlaifer  a 
bit  too  rapid  for  good  digestion. 

The  titles  to  the  chapters  in  this  syllabus  are 
congruent  with  the  titles  given  in  the  basic  text.  An 
attempt  has  been  made  to  introduce  problems  to  further 
illustrate  the  basic  theory  presented  by  Professor  Schlaifer 
in  order  that  the  student  may  gain  greater  facility  in 
problem  setups,  and  gain  a  deeper  sense  of  familiarity 
and  feeling  for  statistics  as  it  applies  to  the  every- 
day application  both  in  the  Navy  and  outside  of  it. 

Professors  Freund  and  Williams  describe  the  re- 
lationship of  business  statistics  in  modern  business: 

.  .  .  most  elementary  textbooks  stressed  the  collect- 
ion and  sources  of  data  and  the  presentation  of 
numerical  data  in  tables,  charts,  pictures,  and  maps. 
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Although  the  importance  of  statistics  in  this 
phase  of  business  connot  be  denied,  there  has 
fceen  a  pronounced  change  in  attitude  toward  the 
subject:  business  statistics  is  now  viewed  as 
providing  quantitative  bases  for  arriving  at 
well-informed  decisions  with  respect  to  all 
matters  connected  with  the  operation  of  a  business. 
In  short,  it  is  now  recognized  that,  in  its 
broadest  sense,  business  statistics  includes 
also  the  methods  and  inferences  needed  to  pro- 
vide an  adequate  flow  of  quality  raw  materials, 
to  select  and  evaluate  the  performance  of  both 
machines  and  personnel,  to  design  -products  and 
maintain  their  quality,  and  to  evaluate  new 
methods  of  production,  advertising,  and  selling 
the  goods  and  services  of  industry,  both  in  the 
Immediate  present  and  in  the  near  and  distant 
future . 1 

There  are  several  types  of  statistics,  such  as, 
descriptive  statistics,  inductive  statistics,  vital 
statistics,  and  quantum  statistics.   This  syllabus  is 
primarily  concerned  with  inductive  statistics.   Descrip- 
tive statistics  is  understood  to  mean  the  treatment  of 
any  numerical  data  which  does  not  involve  generalizations. 
When  predictions,  estimations,  or  generalizations  are  made, 
we  are  speaking  of  inductive  statistics.  For  example,  take 
five  light  bulbs  of  brand  A  and  five  light  bulbs  of  brand 
B  and  burn  them  continuously  until  they  burn  out.   We  can 
easily  determine  the  average  lifetime  of  the  five  bulbs  of 
each  brand.   Suppose  brand  A  produced  the  following  re- 


Professor  John  E.  Freund,  and  Professor  Frank  J.  Williams, 
Modern  Business  Statistics.   (Englewood  Cliffs:  Prentice- 
Hall,  Inc.,  1958)   p.  2. 
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suits : 


98?  +  863  +  102^  1-  972  +  7^6  =  918  averag^  hours/bulb 


and  Brand  B: 


892  ±  io7i  +  99?  t  929  ±  78?  =  ^  average  hours/bulb. 


The  computations  thus  far  are  in  the  realm  of  descrip- 
tive statistics.   If  it  is  concluded  from  this  data  that 
brand  B  is  superior  to  brand  A,  then  a  generalization  has 
been  made.   This  is  characteristic  of  inductive  statistics. 
It  is  readily  apparent  from  a  cursory  inspection  of  the 
data,  that  the  generalization  might  or  might  not  hold 
true.   If  we  ;\rere  dealing  with  6  light  bulbs,  the  data 
from  the  sixth  light  bulb  of  each  brand  could  reverse  the 
averages,  i.e.,  the  average  lifetime  of  brand  A  is  now 
greater  than  the  average  lifetime  of  brand  B.   In  either  case, 
risk  is  involved  of  making  the  wrong  generalization.   The 
analysis,  evaluation  and  control  of  chances  associated  with 

this  type  of  risk  is  one  of  the  main  functions  of  inductive 

2 

statistics. 


2Ibid.pp.  3-^ 


CHAPTER  I 
1.   THE  MEANING  OF  PROBABILITY 
1.1  Description 

Not  long  ago,  the  owner -manager  of  an  automobile 
dealership  said  to  me,  "My  business  has  prospered  because 
of  luck.  Every  move  I  have  made  has  been  a  gamble,  some 
have  been  won  and  some  have  been  lost  but  if  one  risks 
nothing,  he'll  not  have  a  business  very  long."  This 
businessman  is  a  dealer  in  automobiles.  Further  discuss- 
ion with  him  concerning  his  decisions  for  ordering  cars 
by  color,  model,  interiors,  parts,  etc.,  revealed  that 
almost  all  decisions  along  these  lines  were  made  intuitively 
either  by  himself  or  his  sales  manager.  He  said  that  he 
is  continually  faced  with  the  problems  like:  How  much 
advertising  is  the  optimum  amount?   (Four  hundred  dollars 
for  a  two  minute  plug  on  a  TV  station  is  approximately 
equal  to  the  profit  in  one  car  sale)  Kow  many  skits  should 
he  buy?  At  what  time  of  day  are  his  potential  customers 
most  likely  to  be  watching  TV,  etc.?  How  much  gas  should 
be  put  into  the  cars  sold  to  customers  for  good  will? 
These  are  a  few  examples  in  this  business  where  some  kind 
of  a  probability  model  might  be  an  excellent  aid  to  the 
solution  of  some  of  these  problems  that  are  solved  intuitive- 
ly now. 

Everyone  faces  the  future  with  uncertainty  in  private, 
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public  or  business  life,  in  every   /ofession,  and  in 
every  government  of  the  world.  For  example,  we  are  con- 
stantly  aware  of  the  uncertainties  that  can  occur  during 
a  naval  operation.   How  many  destroyers  are  needed  to 
successfully  complete  a  mission?  A  destroyer  cannot 
steam  without  feedwater;  what  are  the  chances  of  her 
evaporators  breaking  down?  How  great  a  risk  is  acceptable 
of  not  completing  a  mission?  How  reliable  are  the  Polaris 
Missiles  and  how  can  we  estimate  their  reliability?  How 
good  is  our  estimate?  What  are  the  risks  as  determined 
by  tests  and  what  risks  are  acceptable?  These  are  a  few 
of  the  problem  areas  that  can  bo  examined  in  the  light  of 
statistical  theory.   There  are  literally  thousands  of 
applications  of  it.  To  introduce  elementary  probability, 
the  die  model  offers  an  excellent  framework  for  thinking 
about  it. 
1.2  The  Die  Model1 

Can  you  think  of  a  die  with  other  than  six  sides? 
One  with  two  sides  may  be  constructed  by  the  intersection 
of  two  parallel  planes  with  a  sphere.  A  three-sided  die 
may  be  constructed  by  the  intersection  of  a  sphere  and 
three  skew  planes,  etc.   With  some  Imagination  one  can 
expand  this  thought  process  visualizing  a  four-sided,  a 


1 
Paul  G.  Hoel,  Elementary  Statistics  (John  Wiley  & 
Sons,  Inc.,  I960),  pp.  36-37. 
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five-sided,  the  normal  six-sided  die,  a  terwsided,  twenty- 
sided,  or  "infinitely-sided"  die  (sphere).  What  pro- 
babilities would  you  assign  to  each  face  of  each  kind  of 
die  coming  up  in  a  roll?  You  would  most  likely  assign  to 
the  two-sided  die,  §,  to  the  three-sided  3,/3>  to  the  four- 
sided  1/4- ,  etc.,  and  to  the  infinitely-sided  die  l/oo  or 
zero.   This  concept  is  easy  to  see  if  you  visualize  rolling 
a  giant  sphere  on  a  large  flat  surface.   What  probability 
would  you  assign  to  its  coming  to  rest  on  any  particular 
point  on  its  surface?  Assuming  that  it  would  rest  on  one 
molecule,  there  are  almost  an  infinite  number  of  molecules 
on  which  it  could  rest.  The  probability  that  it  would 
rest  on  any  particular  one  would  be  almost  l/oo  or  zero. 

Take  a  die  with  six  faces,  intuitively  a  probability 
of  1/6  would  be  assigned  to  any  number  1  to  6  coming  up 
on  the  roll.   This  assessment  could  be  incorrect  if  for 
example,  the  die  had  a  five  on  every  face,  or  it  was  load- 
ed, filed  on  one  side  or  corner,  or  shaved.   A  dishonest 
die  would  be  assigned  different  values  to  the  probability 
of  each  face  coming  up  on  a  roll. 

Assuming  that  all  the  possible  experimental  outcomes 
can  be  expected  to  occur  with  the  same  frequency,  then  the 
probability  of  occurence  of  each  event  can  be  defined 
as  follows: 
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The  probability  that  an  event  A  will  occur  is  the 

ratio  of  the  number  of  outcomes  that  correspond  to  the 

2 
occurrence  of  A  to  the  total  number  of  possible  outcomes. 

This  definition  fits  the  case  of  the  honest  die.  As 

an  illustration,  take  a  pair  of  honest  dice.   What  possible 

combinations  are  there?  The  possible  outcomes  are  listed 

in  Table  1. 

TABLE  1 

13  3A  *  15  @ 

23  24  (25)  26 

33  ©  35  36 

(43)  44  45  46 

53     5^     55  56 

63     64     65  66 

Each  digit  in  Table  1  is  the  number  on  a  die.   The 
circled  values  indicate  the  combinations  that  total  7. 
In  accordance  with  the  definition,  a  probability  of  6/36 
would  be  assigned  to  event  A  =  7  coming  up  on  a  pair  of 
dice.   There  are  36  possible  outcomes  and  6  outcomes 
corresponding  to  A.   This  may  be  written  P(A)  =  6/36. 
Let  B  =  11.   The  probability  of  B  is  2/36.   Let  C  =  11  or 
7.   Then  P(C)  =  8/36. 


11 

12 

21 

22 

31 

32 

41 

42 

51 

@ 

£l\ 

62 

2Ibid.   pp.  36-37 


1-4 


Suppose  the  dice  are  dishonest  and  all  the  l's  are 
replaced  by  2*s.   The  shape  of  the  dice  has  not  been  chang- 
ed so  there  would  be  no  change  to  the  probabilities  assigned 
to  each  face  coming  up  on  a  roll.   This,  is  a  case  of  the 
more  general  type,  one  in  which  the  various  outcomes  of  an 
experiment  are  not  expected  to  occur  with  the  same  relative 
frequency.   In  the  more  general  case,  the  definition  is 
modified: 

The  probability  that  an  event  A  will  occur  is  defined 
as  the  sum  of  the  expected  relative  frequencies  for  the 
possible  outcomes  that  correspond  to  the  occurrence  of  A. 
For  example,  let  A  =  7.  The  student  should  verify 
?(A)  =  2/36  +  1/36  +  I/36  +  2/06  =  1/6 
because  these  numbers  are  the  expected  relative  frequencies 
corresponding  to  the  four  favorable  outcomes,  i.e.,  25, 
3^»  ^3,  and  52.   This  result  is  the  same  as  P(A)  in  the 
first  Illustration.  For  the  combinations  16  and  6l,  has 
been  substituted  the  combinations  25  and  25.   Let  B  =  k. 
The  only  possible  outcome  now  is  22  since  there  are  no 
l's  on  the  dice.  Then 

P(B)  =  V36 
because  there  are  four  ways  of  making  B  with  these  dishonest 
dice.  The  result  is  not  the  same  as  that  for  the  previous 
experiment  of  rolling  two  honest  dice. 


1-5 


This  procedure  of  assigning  expected  relative 
frequencies  to  each  of  the  possible  outcomes  of  an 
experiment,  whether  they  are  assumed  to  be  equal  or 
not,  is  part  of  the  process  of  constructing  a 
theoretical  model  for  the  actual  experiment.   If 
the  assigned  relative  frequencies  are  close  to  the 
relative  frequencies  that  will  be  obtained  when 
the  experiment  is  repeated  a  large  number  of  times, 
then  the  probability  of  an  event  A  calculated  on 
the  basis  of  those  assigned  relative  frequencies 
will  be  close  to  the  relative  frequency  with  which 
the  event  A  will  occur  when  the  experiment  is  re- 
peated a  large  number  of  times.   Thus,  in  rolling 
two  dice,  the  model  that  one  usually  chooses  is  the 
one  in  which  all  36  outcomes  are  assigned  the  same 
relative  frequency.   Since  experience  with  dice  has 
shown  that  this  is  a  realistic  model,  it  will  usually 
be  found  that  the  observed  relative  frequency  of 
getting  a  total  of,  say,  7  joints  will  be  close  to 
the  probability  value  of  1/6  if  a  large  number  of 
rolls  is  made.   It  is  the  statistician's  privilege  to 
assign  any  consistent  relative  frequencies  he  de- 
sires to  the  possible  outcomes  of  his  experiment; 
however,  he  itfill  naturally  attempt  to  make  his  model 
as  realistic  as  possible  so  that  his  resulting  pro- 
babilities of  events  will  be  realistic  and  therefore 
useful  for  solving  practical  problems. 3 

It  should  also  be  noted  that  on  the  basis  of  histor- 
ical data  obtained  in  the  recent  past,  the  statistician 
will  modify  his  original  probability  assignments  to  make 
his  model  more  realistic.  The  more  realistic  the  pro- 
bability model,  the  more  likely  the  conclusions  drawn  from 
it  about  the  future  will  be  realistic. 
1.3  The  Addition  Rule. 

If  A  and  B  are  mutually  exclusive  events,  this  means 
if  A  occurs,  then  B  can  not  occur.   For  example,  if  in 


febid.  p.  38. 
Ibid,  p.  40. 
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the  dice  illustration  a  7  comes  up  in  a  roll,  11  can 
not  come  up  in  the  same  roll.   The  number,  of  outcomes 

corresponding  to  the  occurrence  of  A  +  B  is  equal  to  the 

i 
sum  of  the  number  of  outcomes  corresponding  to  the  occur- 
rence of  A  and  the  number  of  outcomes  corresponding  to 
the  occurrence  of  B.  Then 

P(A  +  B)  =  P(A)  +  P(B). 
For  more  than  two  mutually  exclusive  events  the  formula 
may  be  extended,  e.g., 

P(A  +  B  +  C)  =  P(A)  +  P(B)  +  P(C) 
As  an  illustration  with  two  dice  again  let  A,  B,  and  C, 
be  the  events  of  getting  a  total  of  e:-:actly  10  points, 
11  points,  and  12  points,  respectively.   From  Table  1, 
the  probabilities  of  those  events  occurring  are  given 
by  P(A)  =  3/36,  P(B)  =  2/36,  and  P(C)  =  I/36.  The 
probability  that  at  least  one  of  these  mutually  exclusive 
events  will  occur  is  given  by 

?(A  +  B  +  C)  =  3/36  +  2/36  +  I/36  =  1/6. 
1.^  The  Sum  of  the  Probabilities  of  all  events  is  1.00 

Take  the  case  of  a  two-sided  .  le  or  coin.   The  pro- 
bability of  getting  H(heads)  in  the  first  toss  is  ^. 
The  probability  of  getting  T( tails)  the  first  toss  is  |. 
•|  +  §  as  1.00.   One  can  also  look  at  this  probability 
concept  from  the  point  of  view  of  one  event.   The  P(H) 
i.e.,  (probability  of  getting  heads)  on  the  first  toss  is 
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I,  and  the  probability  of  H  (not  heads)  not  getting  heads 

is  1.00  -  i  ■  i.   Extend  this  line  of  thinking  to  the  six- 

i  • 
sided  die.  The  P(5)  coming  up  on  a  throw  is  1/6,  and  the 

P  (5)  (probability  of  not  j5)  is  1.00  -  1/6  =  5/6. 

1.5  ■  Problems 

1.  Find  the  P  of  drawing  from  a  contract  bridge 

deck  of  cards : 

(a)  An  ace 

(b)  Pour  aces  without  replacement 

( c )  A  spade 

(d)  Not  a  spade 

(e)  Not  an  ace 

(f )  Two  aces  without  replacement 
on  titfo  draws 

(g)  A  red  card  or  a  jack  on  one 
draw 

Answers,   (a)  4/52  (b)  4/52  x  3/51  x  2/50  x  1/49 
(c)  1/4   (d)   3/4    (e)   12/13 
(f)  4/52  x  3/51      (?)  28/52 
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CHAPTER  2 
Expected  Value  and  Utility 
2.1  Illustrative  Definitions 

2.1.1  -Conditional  value-The  value  of  an  act  on  condition 

that  some  event  occurs,  eg:  the  value  to  a  business- 
man of  a  contract  on  condition  that  he  gets  the 
contract;  or  the  value  to  a  businessman  (cost  to 
him)  of  submitting  a  contract  proposal  on  condition 
that  he  does  not  get  the  contract;  also  the  value 
to  a  businessman  of  a  contract  on  condition  that  he 
does  nothing.   For  example,  a  contract  with  a 
$35,000  gross  profit  which  cost  $10,000  to  prepare 
for  would  have  a  $25,000  value  on  condition  of  being 
won,  a  minus  $10,000  value  on  condition  of  being 
not  won  and  a  zero  value  on  condition  of  not  sub- 
mitting a  proposal  at  all. 

2.1.2  Expected  value — A  weighted  average  of  all  the  con- 

ditional values;  the  weights  applied  to  the  con- 
ditional values  are  the  probabilities  that  the 
alternative  events  will  occur,  egt  the  probability 
that  the  businessman  will  get  the  contract  and  the 
probability  that  he  will  npt  get  the  contract.   In 
simplest  terms  it  is  the  old  fashioned  arithmetic 
average • 
2.2  The  expected  monetary  value-of  an  act  oan  be  found  by 
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constructing  a  payoff  table  as  follows: 

Table  2,1  Expected  monetary  Value  of  Making  the  Proposal 

Conditional   Expected 
EVENT  Probability      Value        Value 

Get  contract  .5         $25,000       $12,500 

Do  not  get  contract      .5        -  10,000       -  5t000 

1.0  $  7,500 

The  Act  of  submitting  a  proposal  for  this  $35,000  gross 
profit  contract  has  an  expected  monetary  value  of  $7,500. 

Frequently  expected  value  may  be  negative.   For 
example ,  what  Is  the  expected  value  to  each  player  when 
they  have  an  even  $10  bet  on  the  toss  of  a  coin?   One 
player  has  a  .5  probability  of  winning  $10  and  the  other 
has  a  .5  probability  of  losing  $10.   In  this  case  the  expect- 
ed monetary  value  will  be  zero:  $10(.5)  -  $10(.5)  -  0. 
Suppose  that  for  some  foolish  reason  player  A  feels  sure 
he  will  win  and  is  willing  to  bet  $20  against  player  B*s 
$10;  what  is  the  expected  monetary  value  of  the  toss  to  player 
A?   $10(.5)  -  $20(.5)  -  -  $5.   To  player  B?  -$10  (.5)  + 
$20(.5)  =  $5. 

2.3  In  addition  to  problems  of  a  monetary  nature  the 
concept  of  expected  value  is  also  applicable  in  other  areas 
such  as  Inventory  and  sales  forecasting.   A  military  problem 
of  inventory  control  of  an  electron  tube  for  which  there 
might  be  12,000  gross  requirements  would  have  an  expected 
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value  of  5,800  units  as  follows: 

Table  2.2  Computation  of  an  Expected  Inventory  Requirement 


EVENT 

Probability 

Conditional 
Requirement 

Expected 
Requirement 

Hot  war 

.25 

10,000 

2,500 

Cold  war 

.60 

5,000 

3,000 

Normal  peacetime 

.1? 

2,000 

300 

1.00  5,800 

Conditional  requirements  in  this  case  are  based  on 
the  anticipated  number  of  equipments  in  use  over  a  period 
of  time  for  which  there  is  a  requirement  for  this  particular 
tube,  and  the  probabilities  are  estimates  assigned  to  the 
occurrences  of  hot  war,  cold  war  or  normal  peacetime,  re- 
spectively. 

2.4  The  concept  of  expected  monetary  value  provides  a 
means  by  which  we  can  consider  the  various  consequences  of 
two  or  more  events.   Some  of  these  consequences  may  be 
positive  and  some  may  be  negative,  and  in  summation  they 
tend  to  balance  each  other  somewhat.   Expected  monetary 
value  may  be  likened  to  a  man  standing  with  his  right  foot 
on  a  block  of  ice  and  the  left  one  on  a  warm  stove.   The 
experience  is  neither  all  hot  nor  all  cold,  but  one  or 
the  other  will  likely  predominate.   Looking  at  it  alter- 
natively, if  he  had  ,1ust  come  in  from  the  cold  winter 
weather,  he  might  shift  his  weight  to  his  left  foots  but 
in  a  hot  summer  situation  he  might  shift  to  the  right. 
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Similarly,  if  in  our  previous  example  Player  A  wants  to 
raise  the  game  to  $200  against  player  B's  $100,  player 
3  might  not  be  willing  to  continue  the  game  any  longer 
even  though  the  expected  monetary  value  is  quite  favorable 
to  him — he  cannot  afford  to  lose  $100.   But  Player  A,  being 
a  well-heeled  speculative  individual,  is  willing  to  risk 
his  money  for  a  much  smaller  gain.   This  indicates  that 
not  only  do  individuals  have  different  utilities  for  money, 
but  each  person's  individual  utility  varies  at  different 
levels  ( just  as  the  man  had  differing  utilities  for  the 
ice  and  stove  under  varying  circumstances). 

When  a  businessman  is  faced  with  a  decision  which  has 
one  of  two  possible  consequences  such  as  the  aforementioned 
contract  problem,  expected  monetary  value  may  or  may  not  be 
an  effective  guide  to  action.   If  his  financial  condition 
is  such  that  he  can  afford  to  take  a  $10,000  loss  if  he 
does  not  get  the  contract,  then  he  might  be  quite  willing 
to  accept  that  risk  in  order  to  gain  the  opportunity  of  a 
$25,000  net  profit.   On  the  other  hand,  if  his  financial 
condition  were  marginal,  he  could  ill  afford  the  consequence 
of  losing  $10,000  if  he  did  not  get  the  contract  and  the 
expected  monetary  value  of  the  contract  would  be  zero  to 
him  for  he  would  not  submit  a  proposal  at  all. 

Therefore  in  making  the  decision  on  whether  or  not  to 
use  expected  monetary  value  as  a  guide  to  action,  the 
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businessman  should  reduce  his  problem  to  a  decision  between 
(1)  an  act  which  is  certain  to  result  in  receipt  of  a  certain 
amount  of  cash  and  (2)  an  act  which  will  result  in  either 
the  best  or  the  worst  of  all  oonsequences  of  the  decision 
problem.   In  the  case  illustrated  above  this  would  be  a 
choice  between  (1)  $0  and  (2)  either  a  profit  of  $25,000 
or  a  loss  of  $10,000  for  an  expected  monetary  value  of 
$7»500»   If  the  stakes  are  so  large  that  the  businessman 
does  not  want  to  make  the  decision  in  this  manner,  then 
expected  monetary  value  would  not  be  a  valid  guide  for  his 
action  and  another  decision-making  guide  must  be  found. 
This  is  done  by  the  use  of  individual  utility  curves. 
2,5  Development  of  Utility  Curves 

Utility  is  defined  as  the  power  to  satisfy  human  wants. 
The  objective  of  the  individual  is  held  to  be  maximization 
of  the  total  utility  he  can  achieve  with  his  limited  resources 
of  time,  money,  and  effort.  The  rationality  of  the  individual 

is  defined  in  terms  of  the  utilization  he  makes  of  his  scarce 

1 
resources  to  achieve  this  end  of  maximization  of  utility. 

It  is  derived  from  the  economic  theory  of  diminishing  marginal 

utility.   The  more  you  have  of  a  commodity,  including  money, 

the  less  desireable  is  the  possession  of  one  more  unit  of 

it.   To  own  a  Cadillic  is  most  desireable  and  a  second  one 


David  W.  Miller  and  Martin  K.  Starr,  Executive  Decisions  and 
Operations  Research,  (Englewood  Cliffs,  N.J.:  Prentice 
Hall  Inc.,  1961 )   p.  20. 
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might  be  quite  nice  but  by  the  time  you  got  to  the  third 
one,  its  subjective  value  (utility)  to  you  is  quite  a  bit 
less  than  the  second  and  substantially  less  than  the  first. 
The  same  is  true  of  steaks,  ice  cream,  money,  etc. 

A  utility  curve  is  a  graphic  indication  of  how  an 
individual  will  react  when  faced  with  certain  types  of 
decisions  of  a  monetary  nature.   The  basic  text  gives  a 
detailed  and  thorough  explanation  of  how  these  curves  are 
developed.   The  example  in  the  text  is  reviewed  here  be- 
cause it  provides  a  concise  explanation  of  utility. 

Utility  curves  have  some  limitations: 

1)  A  utility  curve  applies  only  to  one  Individual 
in  a  particular  type  of  situation  and  not  to  all 
types  of  situations  with  which  this  individual 
might  be  faced; 

2)  utilities  cannot  be  added-the  utility  of  one 
consequence  cannot  be  added  to  that  of  another 
to  obtain  the  utility  of  both  together; 

3)  utility  curves  are  not  guides  to  social  action- 
because  one  man  cannot  afford  to  lose  as  much  as 
another  as  indicated  by  a  more  conservative  utility 
curve  is  no  justification  to  tax  the  man  who  can 
afford  to  be  more  risky. 

The  factors  considered  in  deriving  an  individual^ 
utility  curve  reveal  that  any  particular  curve  is  a  subjective 
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Indication  of  that  individual's  behavior  pattern  when  he  is 
faced  with  certain  types  of  decisions-hls  attitude  toward 
risk,  profit  and  loss. 

Figure  2-1  shows  the  utility  curves  of  four  individuals 
who  are  faced  with  decisions  that  can  result  in  a  series 
of  consequences  from  losing  $10,000  to  making  a  profit  of 
$10,000: 
Individual  A — is  probably  a  marginal  operator  who  must 
have  a  very  good  probability  of  making  a 
profit  before  he  is  willing  to  risk  any 
loss  at  all  in  order  to  gain  a  profit.   He 
is  so  conservative  that  he  Is  willing  to 
pay  a  substantial  amount  for  Insurance 
against  a  loss  even  though  the  probability 
of  loss  is  small. 
Individual  B — is  probably  either  a  speculator  or  an  in- 
1       dividual  with  substantial  resources  who  is 

willing  to  risk  much  in  order  to  gain  an  even 
bigger  profit.   Even  though  the  probability 
of  making  a  large  profit  is  small,  he  would 
take  a  chance  on  it  in  preference  to  a  small 
profit  on  a  sure  thing. 
Individual  C — is  somewhat  of  a  moderate  type  and  is  be- 
tween A  and  B  in  his  attitude  toward  risk. 
He  uses  . expected  monetary  value  as  a  guide 
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to  action. 
Individual  D — takes  a  very  conservative  attitude  toward 
risk  when  the  probability  of  profit  is 
small  but  as  the  probability  of  success 
Increases,  he  becomes  more  and  more  willing 
to  take  a  chance. 


1.0  r 

.9 
.8 

.7 
.6 

-  .5 

>» 

4> 

.4 

.3 
.2 
.1 


-p 


1 

hy 

j/^ 

A, 

^B 

/k 

/ 

0 

_______ 

-10   -8    -6   -4   -2    0   +2   +k       +6       +8   +10 
Cash  profit  or  loss  (000  dollars) 

Figure  2.1.  Utility  of  money  for  four  different  individuals 
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Table  2.3.  Cash  equivalent  of  a  $10,000  profit  or  loss 
contract  for  four  different  Individuals. 


Probability 

If 

Cash 

eauiva 

lent   for   Individual 

of 

$10,000  profit            A 

B 

C 

D 

.85 

0 

9,700 

7,000 

9,500 

.75 

-3,000 

9,000 

5,000 

8,500 

.50 

-7,000 

7,000 

0 

0 

.25 

-9,000 

3,000 

-5,000 

-8,500 

.15 

-9,700 

0 

-7,000 

-9,500 

Utility  curves  have  the  following  uses: 

1)  They  can  be  used  to  compute  the  cash  equivalent  of 
a  particular  contract  for  a  businessman. 

2)  They  Indicate  how  much  a  businessman  might  be 

l 

,  willing  to  pay  for  insurance  against  certain 
types  of  losses  . 

3)  They  indicate  the  point  at  which  a  businessman  Is 
indifferent  about  either  accepting  or  rejecting  a 
contract. 

4)  They  provide  a  method  of  analysis  for  determining 
whether  or  not  a  businessman  should  accept  or  re- 
ject certain  types  of  contracts  that  are  offered 
him. 

5)  They  indicate  whether  or  not  expected  monetary 
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value  can  be  used  as  an  effective  guide  to 
action.   (In  the  case  of  Businessman  C,  it 
can. ) 
Problems  on  expected  Monetary  Value 

1.   A  new  furniture  store  in  a  growing  community  is 
considering  purchase  of  a  certain  type  of  wall- 
to-wall  carpeting  for  resale.   Based  on  the 
number  of  new  homes  being  built,  they  have  assign- 
ed the  following  probabilities  to  the  market  for 
wall-to-wall  carpeting  for  the  next  year. 

Sales  in  Sq .  Ft.  Probability 

6,000  .1 

10,000  .2 

20,000  .5 

45,000  .2 

If  the  store  purchases  5,000  sq.  yds.  at  $3.00/yd., 
what  must  the  selling  price  be  in  order  to  realize  a  profit 
of  $3,000? 


Solution: 

Sales  in  Sq 

Ft 

Probability 

Expected  Sales 

•  6,000 

.1 

600 

10,000 

.2 

2,000 

20,000 

.5 

10,000 

45,000 

.2 
1.0 

9.000 
21,600 

2-10 


Gross  Income:  $15,000  +  $3,000  =  $18,000 

Dividing  gross  income  by  expected  number  of  sales, 


18*000  «  $  .83/  sq.  ft.  or  $7.50/sq.  yd. 
21,600 


2.   A  naval  officer  has  ,1ust  inherited  $10,000.   He 
is  a  speculative  type  who  is  considering  "invest- 
ing" his  windfall  in  an  afternoon  at  the  races. 
But  he  is  also  interested  in  a  more  conservative 
long  range  investment.   After  much  careful  study 
of  the  Racing  Form,  he  estimates  that  he  has  about 
a  50-50  chance  of  winning  $8,000  or  losing  $^,000 
on  an  afternoon  of  betting.   He  has  another  opportunity 
to  make  a  $10,000  investment  in  real  estate.   He 
estimates,  that  if  business  conditions  are  good, 
that  within  a  year  the  land  will  be  developed  and 
he  can  make  a  $5,000  profit,  but  if  no  development 
takes  place,  he  will  lose  $5,000  on  a  subsequent 
resale  when  he  is  transferred.   He  estimates  a 
.75  probability  that  the  land  will  be  in  demand 
for  development  and  a  .25  probability  of  no  de- 
velopment.  If  he  loses  at  the  races,  he  cannot 
make  the  real  estate  investment  because  he  has  no 
other  source  of  funds.   If  he  uses  expected  monetary 
value  as  a  guide  to  action,  what  choice  should  he 
make?  What  is  the  expected  monetary  value  of 
each  act? 
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Solution : 

There  are  three  possible  acts  each  with  an  expected 

monetary  value  as  follows! 

Aot  I-An  afternoon  at  the  races: 

Event           Probability  Expected  Monetary  Value 

Win               .5  .5  x  $8,000  =  $4,000 

Lose              .5  .5  x  -4,000  ■  -2,000 

1.0  $2,000 

Act  II-Porego  the  races  and  Invest  In  real  estate: 

Event          Probability  Expected  Monetary  Value 

Area  Is  dev-      .75  .75  x  $5,000  -  $3,750 
eloped 

Area  Is  not       .25  .25  x  -5,000  -  -1,250 

developed       —  »2^00 

Act  III-An  afternoon  at  the  races  and  Invest  In  real  estate: 

Event          Probability  Expected  Nonetary  Value 

Lose  $4,000  .            .5  .5  x  -$4,000  =  -$4,000 

Win  $8,000/    .5  x  .25  -  .125  .125  x  3,000  =     750 
lose  $5,000 

Win  $8,000/    .5  x  .75  -  .375  .375  x  13,000  -   4,875 

gain  $5,000             

1.000  $1,625 

Since  Act  II  has  the  highest  expected  monetary  value,  he 

should  forego  the  races  and  Invest  In  real  estate. 
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3.  An  individual  Is  considering  Investing  $10,000 
In  a  ship-building  company.   If  there  is  peace 
and  increased  international  trade  the  Investment 
will  return  5%   out  if  there  is  a  war  it  will 
return  only  k%   due  to  excess  profits  taxes.   If 
the  investor  assigns  k0%   probability  to  peace  and 
60%   to  war,  what  is  the  conditional  value  of  the 
investment  under  each  condition?  Should  he  make 
an  alternative  investment  that  offers  a  sure  thing 
at  kbfl 

Solution; 

EVENT   Probability  Conditional  Value   Expected  Value 

Peace       .4      $10,000  x  .05  =#500   $500  x  .4  -  $200 

War        .6       10,000  x  .04  ■  400    400  x  .6  ■  240 

$440 

A  sure  thing  at  4^  would  result  in  an  expected  monetary 
value  of  $450.   Since  this  is  better  than  $440,  it  would  be 
the  better  investment  for  an  individual  using  expected 
monetary  value  as  his  decision  criterion. 

4.  An  investor  owns  a  fleet  of  trucks  which  he  has 
leased  to  a  manager-operator  for  a  fee  based  on  a 
share  of  the  profits.  If  successful,  the  present 
value  of  future  rentals  is  estimated  at  $100,000, 
but  if  not  successful,  the  present  value  would  be 
$20,000.   The  manager -operator  has  offered  to  buy 
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the  trucks  outright  from  the  investor  for  $68,000. 
If  the  investor  uses  expected  monetary  value  as 
a  guide  to  action  what  probability  should  be 
assigned  to  success  in  order  for  him  to  be  in- 
different about  selling?   If  he  considers  the 
probability  between  success  and  failure  about 
even,  should  he  sell? 

Solution* 

Event    Conditional  Value   Probability  Expected  Value 

Success   $100,000  P       $100,000  P 

Failure     20,000  1-P        20,000  (1-P) 

1       $  80,000  P  +  $20,000 

The  expected  monetary  value  of  the  sale  is  $68,000. 
If  the  investor  is  to  be  indifferent  between  selling  and 
renting,  the  two  actions  must  have  the  same  expected 
monetary  value, 

$80,000  P  +  $20,000  ■  $68,000 

P  =      .6 

If  he  considers  the  probability  of  success  .  5i  and 
to  be  indifferent  requires  a  probability  of  success  of  .6, 
then  the  probability  of  success  is  not  high  enough  and  he 
should  sell.  Suppose  the  offer  to  purchase  outright  had 
been  $59 , 000,  should  he  sell?  No.   Why  not? 
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Problems  on  Utility 

5.   Assume  that  the  below  data  Is  derived  from  the 
Risky  Investment  Company. 

Utility  TT  Equivalent  Consequence 

0  -  $50,000 

.3  -  40,000 

.4  -   30,000 

.45  -  20,000 
.5  0 

.55  20,000 

.6  30,000 

.7  40,000 

.1  50,000 

a)  Draw  a  utility  curve  and  compute  the  expected 
utility  for  the  following  Investments.  Indicate 
which,  If  any  the  Risky  Company  should  make: 


Investment 

X 

Investment 

Y 

Event 

Probability  Consequence 

Event 

Probability 

Consequence 

P 

.30 

-$40,000 

T 

.10 

+148,000 

Q 

.35 

+  ^5,000 

U 

.50 

+  10,000 

R 

.15 

0 

V 

.40 

-  35,000 

S 

.20 

-  12,000 

b)  How  much  should  the  Risky  Company  be  willing  to 
pay  for  Insurance  against  a  $25,000  loss  If  the 
probability  of  loss  occurring  Is  .25? 

o)  For  what  probability  of  a  profit  would  the 
Risky  Company  be  Indifferent  about  accepting 
an  Investment  on  which  they  could  either  make 
or  lose  $20,000? 
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Answers : 


d)  For  what  probability  of  a  profit  would  they 
be  Indifferent  about  accepting  an  invest- 
ment on  which  they  could  either  make 
$30,000  or  lose  $15,000? 

a)  The  expected  utility  of  investment  X  is 
•539.  investment  Y  is  .^82.   They  should 
therefore  accept  investment  X  but  not  Y, 

b)  $500   . 

c)  .5 

d)  .23 


Note  1  Answers  to  the  above  problems  may  differ  somewhat  due 
to  differences  in  plotting  and  graph  interpretation. 
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CHAPTER  3 

Random  variables  and  Frequency  Distributions 
3.1  Histograms  and  right  and  left  tail  distributions 

An  important  concept  to  get  well  f ixed  in  your  mind 
at  this  point  is  that  of  the  histogram  and  the  right  and 
left  tail  distributions  of  the  random  variable.  Upon 
this  basic  foundation  will  be  built  many  interesting  and 
Important  ideas.   Let's  examine  for  a  moment  what  a  random 
variable  is.   We  shall  define  it  and  then  show  some  examples 
and  variations  of  its  form  and  use.   The  symbol  z  (read 
z-tilde)  is  the  specific  value  that  the  variable  may  take.. 
P(z  =  z)  is  the  short  way  of  saying,  "the  probability 
that  a  certain  elementary  event  obtains  for  which  the 
random  variable  z  takes  the  value  z."   In  the  following 
example,  the  random  variable  is  the  number  of  heads  i.e., 
H,  and  it  can  take  any  value  E,  meaning  0  heads,  1  head, 
2  heads,  or  3  heads. 

To  illustrate  the  use  of  these  symbols,  a  theoretical 
frequency  distribution  is  generated  and  shown  below. 
Take  a  fair  coin  and  toss  it  in  sequences  of  3»   There 
are  eight  possible  outcomes: 

HHH   HHT   HTH   THH   HTT   TKT   TTH   TTT 
32221110 


1 
Harry  V.  Roberts,  Statistics  for  Management . 

(University  of  Chicago,  19 61)   pp.  2-8. 
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Notice  that  below  each  combination  is  described  the  number 
of  heads,  in  each  series.   By  thus  defining  the  number  of 
heads,  we  have  implicitly  defined  the  number  of  tails. 

Since  each  toss  of  a  coin  is  an  independent  event, 
it  can  be  tossed  three  times  or  three  coins  tossed  once 
and  derive  the  same  data.   The  P  of  the  eight  combinations 
are: 

p(h  =  3)  =  Kibirxi!?)  =  1/8 

P(H  =  2)  =  3(ix|rx!)  =  3/8 

P(H  =  1)  =  3(|x|x^)  =  3/8 

P(H  =  0)  =  lUxfel;-)  =  1/8 
Total  P         =1.0 


This  is  a  discrete  distribution  meaning  that  it  is 
impossible  to  get  a  fraction  number  of  heads  i.e.,  l|r  H. 
So,  the  results  may  be  displayed  as  follows: 


in 
11 

*r   3/8 

1/8 


-;- 


0       12 

H 
Fig.    3-la 

Note:   P(H  =  H)   Heads   is   the  random  variable  and 

can  be   either  0,    1,    2,    or   3. 
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1.0    - 

^J 

7/8    ■ 

» 

\i     4/8   - 

• 

*5 

1/8    t 

» 

1 

1.0  * 
7/8 


4/8  - 


1/8  - 


0    12    3 

H 
Left-tail  distribution 

Fig.  3-lb 


1   2    3 

H 
Right-tail  distribut- 
ion 

Fig.  3-lc 


1.0 
7/8 


4/8  - 


1/8 


0 


I 


-^ 


7/8 


4/8- 


1/8 


-t- 


+ 


T 


1.0 


■l « 4 f 


-1  0       i   2   3   4 

H  H 

Left-tail  distribution  Right-tail  distribution 

Fig.  3-lcL  Fig.  3-le 


P(H  ="3)  =1.0  means  the  probability  of  getting  either 
three  heads,  two  heads,  one  head,  or  zero  heads  in  three 
tosses  of  a  coin  is  1.00. 
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P(H  -  0)  =  1.0  means  the  probability  of  getting  0  heads, 
one  head,  two  heads,  or  three  heads  in  three  tosses  of  a 
coin  is  1.00. 

From  the  derivation  of  P(H  =  0,  1,  2,  or  3)  notice 
the  coefficients  1,  3,  3,  1;  these  are  the  coefficients  of 
the  third  series  of  the  binomial  expansion  of  the  form 
(a  +  b)5  =  la5  +  ^a2b  +  Jab2  +  lb  3 
For  any  series  these  coefficients  may  be  developed 
by  Pascal's  triangle  the  first  few  of  which  are  given  in 
fig.  3-^. 

Fig.  3-^ 
Pascal's  Triar.gle 
1      (a  +  b)° 
1     1      (a  +  b)1 
1     2     1     (a  +  b)2 
1     3     3     1     (a  +  b)3 
1     l*  6     4     1     (a  +  b)21 

1     5    10    10     5     1     (a  +  b)5 
1     6    15    20    15     6     1     (a  +  b)6 
1     ?    21    35    35    21     ?     1     (a  +  b)? 
Note:  Each  number  in  the  table  is  the  sum  of  the  two 
numbers  above  on  its  left  and  right. 

Now  consider  the  two-sided  die  (coin)  again  and  take 
(a  +  b)3  expanded;  notice  the  congruency.   From  para- 
graph 3.2  we  have : 


3-* 


1  HHH  +  3HHT  +  3KTT  +  1TTT  .' 

H3  +  3H2T  +  3HT2  +  T3 

o?   +  3a2b  +  3ab2  +  b^ 
The  implicit  probabilities  are: 
P  (a)  =  | 

p  (b)  -  4 

We  may  now  step  into  the  realm  of  the  ten-sided  die 
and  consider  it  binomially.   Take  one  particular  face 
called  A.   What  is  the  probability  of  that  face  coming  up? 

P  (A)  =  1/10.   The  P  (A)  —the  probability 
of  not  A  coming  up — is  9/10. 

We  now  have  an  analogous  situation  compared  with  the 
two-sided  die  except  for  the  implicit  probabilities  associat- 
ed with  a  ten-sided  die.   Symbols  adopted  now  for  the  sake 
of  clarity  are  A  which  represents  one  side  of  the  ten-sided 
die  and  N  which  represents  not  A.   Now  write  this  in 
binomial  form  and  expand  say,  for  five  tosses.  What  are  the 
probabilities  of  each  event? 

(1)  (A  +  N)5 

(2)  The  coefficients  from  Pascal^ 
triangle 

l   5   10   10   5   l 

( 3 )  Then 

AT   +  Sk^1   +  10A3N2  +  10A2n3  + 
5A1N^  +  N^      ■  ! 
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.  1  x  10"5 

^5  x  10"5 

810  x  10"-5 

7290  x  10"5 

32805  x  10"5 

590^9  x  10"5 
1.00000 


(if)  The  probabilities  for  each  event 

follows :  ■ 

(1/10)5  +  5(i/io)i|r(9/io)1  + 
10(1/10)3(9/10)2  I-  10(1/10)2(9/10)3 
+  5(1/10)1(9/10)^  +  (9/10)5 
(5)   Perform  the  indicated  operation 
and: 

=  P(5Afs) 

=  P(^A's) 

=  P(3Afs) 

-  P(2A's) 

=  P(lA»s) 

=  P(0A»s) 

=  Total  P  as  it  should  be. 


We  have  just  developed  a  model  that  can  be  used 
throughout  the  remainder  of  Schlaifer's  textbook  and 
will  be  referred  to  occasionally  as  we  proceed  through 
the  remainder  c.s  this  syllabus. 

To  recap  briefly,  the  concept  of  a  multisided  die 
from  one  with  two  sides  to  one  of  an  infinite  number  of 
sides  has  been  introduced.   The  relationship  of  probability 
to  number  of  sides  has  been  shown.   The  binomial  ex- 
pansion has  been  related  to  the  possible  outcomes  of  a 
series  of  tosses  and  the  mathematical  probabilities 
Inherent  therein. 
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Problems 

1.  Develop  and  plot  the  right  tail  cumulative 
binomial  distribution  for  a  series  of  five 
tosses  of  a  fifty-sided  fair  die. 

2.  An  urn  contains  5  black  and  10  white  balls. 
What  is  (a)  the  P(B)?   (b)   The  P(B)  on  two 
successive  draws  with  replacement  after  .each 
draw?   (c)   What  is  P(2B)  without  a  replace- 
ment?  (d)   The  P(B)  followed  by  white  with 
replacement?   (e)   The  P(W)  followed  by  black 
without  replacement? 

Answers. 

2(a)  5A5  (b)  1/9  (c)   5/15  x  4/14  (d)  2/9 
(e)  5/21 
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CHAPTER  4 
Opportunity  Loss  and  Cost  of  Uncertainty 
4.1.  Tables  and  Definitions 

4.1.1  Pay  Off  Tables 

A  payoff  table  indicates  a  conditional  profit  (or 
negative  profit)  for  every  act  given  various  events.  Each 
cell  in  the  table  is  a  net  amount  of  gain  or  loss  for  each 
act-event  combination. 

To  construct  a  pay-off  table: 

1)  List  every  act-event  combination  with  the  acts 
across  the  top  and  the  events  down  the  side  or 
vice  versa. 

2)  For  each  act-event  combination  (each  cell  in  the 
table)  subtract  the  outflows  from  the  inflows  to 
get  the  net  amount. 

3)  Minus  signs  must  be  preserved  and  indicate  negative 
profit. 

4.1.2  Opportunity  Loss-Is  the  difference  between  the  profit 
actually  realized  in  a  situation  and  the  profit  which 
would  have  been  realized  if  the  best  possible  decision 
had  been  made  for  the  event  which  actually  occurred. 
An  opportunity  loss  can  occur  even  though  a  profit 

is  made. 

4.1.3  Loss  Tables -a  loss  table  Indicates  the  conditional 
loss  for  every  act  given  various  events.   It  can  be 
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constructed  directly  in  a  manner  similar  to  pay- 
off tables  or  derived  from  a  payoff  table.   To 
construct  a  loss  table  from  a  payoff  table : 

1)  Indicate  the  greatest  possible  profit  for  each  . 
event  by  some  notation  such  as  a  star  or  circle 
around  the  largest  profit  in  each  row  of  the 
payoff  table. 

2 )  To  obtain  each  entry  in  the  loss  table  subtract 
the  corresponding  entry  in  the  payoff  table  from 
the  starred  or  circled  entry  in  its  row. 

3)  Note  algebraic  signs  in  the  payoff  table  and 

that  all  cells  in  the  loss  table  will  be  positive. 

4.1.4  Expected  profits  and  losses 

1)  The  expected  profits  and  losses  from  all  acts  can 
be  found  by  taking  a  weighted  average  of  each  act 
(column)  in  the  payoff  and  loss  tables.   The  weights 
are  probabilities  assigned  to  each  event. 

2)  The  least  expected  loss  and  largest  expected  pro- 
fit will  both  occur  with  the  same  act. 

3)  The  difference  between  expected  profits  and  losses 
of  any  two  acts  is  equal,  the  profit  being  less 
and  the  cost  being  more  by  the  same  amount. 

4.1.5  Cost  of  Uncertalnty-The  expected  loss  of  the  best 
decision. 

4.1.6  Cost  of  Irratlonallty-The  amount  by  which  the  expected 
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loss  of  a  chosen  decision  exceeds  that  of  the  best 
decision. 
Problems 

1.  The  Navy  buys  an  electron  tube  for  $7.50.   It 
costs  the  Navy  $1.50  to  pack  and  ship  each  tube 
issued.   If  a  tube  is  not  issued  within  one  year, 
it  no  longer  meets  Navy  specifications  and  must 
be  disposed  of. 

(a)   If  unissued  tubes  can  be  sold  to  commercial 
users  for  $5.00,  how  many  tubes  should  the 
Navy  stock  at  the  beginning  of  each  fiscal 
year?  Assume  that  the  Electronic  Supply 
Office  is  operating  under  the  Industrial 
Fund  and  makes  .$.25  on  each  tube  issued. 
The  following  experience  data  is  available 
on  the  probability  of  demand. 

Demand  (in  1. OOP's)       Probability 
25         '  .03 

30  .05 

35  .15 

JfO  .60 

^5  .10 

50  .07 
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Using  expected  profit  as  a  decision  criterion,  the 
Electronic  Supply  office  should  stock  35,000  tubes. 

b.  What  is  the  cost  of  Irrationality  If  the  de- 
cision Is  made  to  stock  40,000? 

(7.2375  -  5.3250)  x  1000  -  $1912.50 

c.  What  Is  the  cost  of  uncertainty? 


Solution: 


Construct  a  loss  table  and  compute  expected 
losses.   Short-cut:   Since  the  cost  of  un- 
certainty Is  the  expected  loss  of  the  best 
decision,  and  the  best  decision  Is  to  stock 
35 » 000,  then  expected  loss  need  be  computed 
for  this  stocking  level  only. 
Answer:  $2637.50 
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CHAPTER  5 
Incremental  Analysis 
5.1  The  incremental  analysis  of  the  problem  stated  on, 
p.  68  of  Schlaifer  is  given  below  on  the' basis  of  increm- 
ental profit. 

Table  5.1  (From  Schlaifer) 


z 

0 
1 
2 
3 

5 
6 


P(z) 

.05 

.10 

.25 
.30 
.20 
.10 
.00 


P(2  ^  z) 

.00 
.05 
.15 
.^0 

.70 

.90 

1.00 


PAYOFF  TABLES 
Stock  1 
P(z  '   1)  =  .05 
PCS'  ^  1)  =  .95 


Cond.  Value 
Stock 

-$2 

1 


Do  Not 
Stock 

$0 

0 


P(z  ^  z) 

1.00  - 

.95 
.85 
.60 
.30 
.10 
.00 


Incremental 

Expected 

Profit 

-.10 
.95 


$.85 

Stock  2 

Cond.  Value 
Stock 

Do  Mot 
Stock 

Incremental 

Expected 

Profit 

P(z  *  2)  "-  .15 

-$2 

0 

-$.30 

P(£  *.2)  -  .85 

1 

0 

.85 

$.55 
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From  our  previous  discussion  we  know  that  the  probabil- 
ities of  all  possible  events  must  add  up  to  1.00.   Careful 
examination, of  table  5-1  reveals  the  same  and  is  stated 
here: 

that     P(z  ^  j)  =  1.00  -  P(z  ^  .1)  (2) 

Substitute  in  equation  (1): 

kp[l   -  P(£  *  .1)]  ^  knP(2  ^  j)  (3) 


then 


kp  -  kpP<2  *  J)  ^  knP(£  *  J)  w 


kp^  P(£  *   ,1)(kp  +  kn)  (5) 

and  finally: 

P('z  M)^  &—  (6) 

kp  +  kn 

Now  all  that  we  have  to  do  to  determine  the  best 
number  to  stock  is  compute  the  cumulative  probabilities 
for  z9  ,  demand  less  than  the  jth  unit^  and  the  critical 
ratio  and  compare.  Select  the  largest  j  where  the  probabil- 
ity of  it,  is  just  less  than  the  critical  ratio. 
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Then 
J 

$i*<2 

^ 

J) 

-   fP2P(z'  -^ 

.1)   ■ 

Expected 

Incremental 

Profit 

Cumulative 

Sum  2__ 

1 

•  95' 

-  .10 

.85 

•   .  .85 

2 

.85 

-  .30 

,55 

1.40  * 

3 

.60 

-  .80 

-  .20 

1.20 

4 

.30 

-1.40 

-1.10 

.10 

5 

.10 

-1.80 

-1.70 

-1.60 

6 

.00 

-2.00 

-2.00 

-3.60 

*  The  best  decision  is  to  stock  2. 

Selection  of  the  best  number  to  stock  is  based  on  the 
j  which  gives  the  greatest  total  e:-:oected  profit.   When  the 
total  expected  profit  is  diminished  oy  stocking  one  more, 
that  one  is  one  too  many.   Let*s  find  an  easy  way  to  make 
this  determination: 

Let  kD  =  the  positive  incremental  profit  resulting 
from  stocking  the  jth  unit  if  sold. 
kn  =  the  negative  incremental  profit  resulting 

from  stocking  the  ,1th  unit  if  not  sold. 
We  want  the  product  obtained  by  multiplying  the  incre- 
mental profit  by  the  cumulative  probability  of  demand  for 
the  Jth  unit  to  be  greater  than  the  product  obtained  by  mult- 
iplying the  negative  profit  by  the  cumulative  probability 
of  no  demand  for  the  jth  unit  and  express  this  relationship 
as  follows: 

kpP(£  -  J)  *  *nP(z  '  1)  (1) 
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5.3  Problems 

1.   A  retailer  sells  an  item  for  $4..  The  cost  of  the 

t 

item  is  $1.50.  Each  item  sold  is  delivered  to  the  customer^ 
the  cost  of  delivery  being  $.50  per  item."  If  an  item  is 
not  sold  at  the  end  of  the  day,,  it  is  disposed  of  at  a 
salvage  value  of  $.30.   The  retailer  assigns  probabilities 
to  demand  for  the  item  in  accordance  with  the  following 
schedule : 

Demand  Probability 

less  than  5  0 

6  .05 

7  .10 

8  .12 

9  -0 

10  .20 

11  .15 

12  .10 

13  .06 

14  .02 

a.  Determine  the  optimum  stoeving  level. 

b.  At  the  optimum  level,  compute  the  expected  profit 
and  expected  loss. 

c.  What  is  the  cost  of  uncertainty? 
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2.     As  the  junior  officer  in  the  crew  of  a  Navy  trans- 
port aircraft,  you  are  responsible  for  ordering  an  adequate 
supply  of  flight  rations  for  the  crew  and  passengers  for 
each  flight.   Your  next  flight  is  a  "R  &  R"  run  from  Ugga 
Buggs  Airport  to  Hong  Kong,  transporting  military  personnel 
on  leave  from  the  ships  tied  up  in  the  Ugga  harbor. 

The  Ugga  Bugga  municipal  laws  require  that  all 
flight  rations  for  aircraft  operating  from  the  Ugga  Bugga 
Airport  be  purchased  from  the  Airport  Cafe  (operated  by  the 
Mayor's  brother).   The  Cafe  has  established  the  following 
very  firm  rules  on  box  lunch  orders: 

1.  Box  lunches  will  be  ordered  and  paid  for 

48  hours  before  departure  time,  (price  $1.50 
each. ) 

2.  If  your  order  is  larger  than  what  you 
actually  need  at  departure  time  (based  on 
actual  number  of  crew  and  passengers  aboard) 
they  will  refund  $  .50  to  you  for  each  lunch 
not  required. 

3.  If  you  need  additional  lunches  (your  order 
was  less  than  the  number  of  passengers  and 
crew  aboard)  they  can  be  purchased  at  the 
cafe  for  $3*00  each  right  up  to  departure 
time. 

A  firm  list  of  the  total  number  of  crew  and  passengers 
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for  the  flight  will  not  be  available  until,  6  hours  before 
the  departure  time.   Each  passenger  and  crew  member  aboard 
will  pay  you  $1.50  for  their  box  lunch;  however,  any  extra 
cost  caused  by  excessive  or  late  ordering  will  come  out  of 
your  pocket. 

After  considerable  discussion  among  the  officers  of 
the  crew,  the  following  probabilities  of  aircraft  load 
(passengers  and  crew)  x^rere  estimated  based  on  the  number  of 
ships  in  port,  how  long  it  has  been  since  pay  day,  etc. 


Load 

Prob. 

Load 

Prob . 

Load 

Prob. 

less  than  75 

0 

81 

.07 

~W" 

.06 

75 

.01 

82 

.08 

89 

.05 

76 

.02 

83 

.09 

90 

.04 

77 

.03 

84 

.10 

91 

.03 

78 

.04 

85 

.09 

92 

.02 

79 

.05 

86 

.08 

93 

.01 

80 

.06 

87 

.07more 

than 

93 

.00 

How  many  box  lunches  should  you  order? 
3.   The  U.  S.  NAVY  owns  3  Polaris  submarines.   (As  can 
readily  be  seen,  the  data  in  this  problem  is  unclassified 
and  may  be  revealed  to  the  general  public.)   The  Navy  also 
has  available  only  24  skilled  technicians  to  repair  these 
three  subs  (which  are  constantly  in  need  of  repair).   The 
subs,  because  of  their  important  mission,  can  be  in  port 
only  8  hours  each  day  (the  home  ports  are  not  the  same  for 
the  three  subs).   When  the  daily  repairs  are  such  that 
the  Navy  technicians  cannot  complete  them  within  eight 
hours,  (this  can  be  determined  at  the  beginning  of  the 
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work-period)  additional  civilian  technicians  areuutilized. 

The  excess  cost  per  day  per  civilian  technician  amounts 

to  $10  more  than  the  out-of-pocket  cost  of  employing  one 

of  the  Navy  technicians.   The  total  number  of  technicians 

(Navy  or  Navy  and  Civilian  as  the  case  may  be)  required 

for  the  past  100  days  is  shown  in  the  table  below. 

Number  of 
Technicians  required 

0 
1 
2 

3 
k 

5 
6 

7 

8 

9 
10 
11 
12 

100      100      100 

a.  Compute  the  optimum  assignment  of  the  Navy  technicians 
to  the  three  subs. 

b.  Under  this  optimum  assignment ,  what  is  the  daily 
expected  excess  cost  of  hiring  civilian  technicians  at 
each  sub. 


Number 

of  Days 

Sub  A 

Sub  B 

Sub  C 

0 

0 

k 

2 

0 

k 

6 

2 

6 

10 

k 

10 

10 

k 

12 

12 

8 

15 

12 

10 

15 

10 

12 

10 

3 

12 

12 

8 

15 

3 

7 

15 

3 

8 

10 

3 

7 

8 

3 
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CHAPTER  6 

Measures  of  Location: 
Fractlles  and  Expectations j 
Linear  Profits  and  Costs 

6.1.  Fractlles 

6.1.1  A  fractlle  Is  a  measure  of  location  In  a  distribution. 

The  "point  f"  fractlle  Is  the  value  In  a  distribution  that 

Is  both: 

1)  Equal  to  or  greater  than  a  fraction  (.f)  of  the 
values  In  the  distribution  (see  a  below)  and 

2)  Equal  to  or  less  than  a  fraction  (l-.f)  of  the 
values  In  the  distribution  (see  b  below) 

Additionally: 

1)  The  same  value  in  a  distribution  can  be  Included 
in  both  the  lower  (.f)  part  and  the  upper  (l-.f) 
part  (see  c  below)  or 

2)  A  fractlle  can  Include  a  range  of  values  in  a 
distribution  (see  d  below) 

To  illustrate  the  above  with  a  graphical  example  assume 
the  following  distribution  of  Lieutenants  in  the  Navy  by- 
age  group:  F    =25 
.3 

-  =  25    a)  This  means  that  .3  of  the  distribution  of 

Lieutenants  in  the  Navy  by  age  group  are  26 
years  old  or  less.  To  state  it  another  way,  this 
fractlle  (.3)  is  equal  to  or  greater  than  the 
fraction  (.3)  of  the  values  (23,  2kt   25,  and  26) 
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In  the  distribution. 

b)  This  also  means  that  .3  of  the  distribution 
inoludes  values  that  are  equal  to  or  less 
than  a  fraction  (l-.f)  or  .7  of  the  values 
in  the  distribution.   This  is  obviously  true 
in  the  example  since  the  number  of  Lieutenants 
who  are  26  years  old  or  less  is  less  than  the 
number  who  are  28  years  old  or  less. 

This  analogy  makes  more  sense  when  applied 
to  the  median  which  is  a  special  case  of  a 
fractile;  namely,  the  .5  fractile 

.45  (lower)  =  F.55   (upper)  =  2? 

c)  This  is  an  example  of  the  fact  that  the  same 

value  (27)  in  a  distribution  can  be  included 
in  both  the  lower  part  (,f)  and  the  upper  part 
(l-.f)  of  a  distribution.   Therefore  in  our 
example  we  may  think  of  some  of  the  Lieutenants 
in  the  Navy  who  are  27  years  old  or  less  as 
belonging  both  in  the  lower  .**5  of  the  dis- 
tribution and  in  the  upper  ,S5   of  the  dis- 
tribution. 
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Table  6,1  Hypothetical  Distribution  of  Lieutenants  In 
the  Navy  by  Age  Groups 


Age  Group  Z 


Number  In 
Group 


Relative 
Frequency 


Cumulative 
Frequency  of 
Z  or  less 


<  23 

23 

24 

25 
26 

27 
28 

29 
30 

31 
>  31 


1.0 

.9 
.8 

.7 
.6 
.5 
.4 

.3 

.2 

.1 

0 


O 

N 

O 

>> 

o 
C 
<u 

d> 

a) 
> 

-P 

as 
H 


0 
750 
750 
1,500 
2,250 
3,750 
3,000 

1,500 
750 
750 

0_ 

15,000 


0 

.05 
.05 

.10 

.15 

.25 

.20 
.10 
.05 
.05 
0 


1.00 


■ 

•v 

-r" 

7 

^C 

'1 

J 

- 



-   > 

• 

1 

2' 

3   24 

1-   2 

5   2< 

>   2? 

'      2 

8  2« 

?     ; 

10  3 

1  32 

0 

.05 

.10 

.20 

.35 
.60 
.80 
.90 

.95 
1.00 


Age  (Z)  of  Lieutenants  in  the  Navy 

Figure  6.1 
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d)  This  is  an  example  of  the  fact  that  a 
F    =  28-29  inc. 

.8  fractile  can  include  a  range  of  values 

in  a  distribution  and  occurs  in  the 
"flat"  areas  of  the  graph  rather  than 
the  vertical  areas.   In  contrast  to 
(c)  above  it  means  that  all  the 
Lieutenants  28  years  old  or  less  be- 
long in  the  lower  .8  whereas  all  those 
of  29  or  more  years  belong  in  the  upper 
.2  of  the  distribution. 
6.1.2  Fractiles  are  also  sometimes  expressed  as  quart iles. 
deciles  or  percentiles.   This  is  the  result  of  dividing  the 
total  frequency  into  equal  parts;  four,  ten  or  one  hundred, 
respectively.  Any  two  or  more  such  positional  measures  can 
reproduce  the  frequency  distribution.   Thus,  k0%   of  the 
observations  lie  between  the  median  and  the  ninetieth  centile. 
The  second  quart lie  is  the  same  as  the  median.   Note  the 
difference  between  the  first  quarter,  say,  and  the  first 
quartile;  the  first  quarter  is  a  group  of  observations  or 
a  range  of  the  variable,  while  the  first  quartile  is  the 
boundary  point  between  the  first  and  second  quarters.   An 
observation  can  be  within  the  highest  tenth,  but  not  within 
the  highest  decile.   The  express ion- "in  the  Xth  centile"- 
is  common  but  Inaccurate. 


^llen  W.  Wallis  and  Harry  V.  Roberts,  Statistics;  a 
New  Approach.  (New  York,  The  Free  Press  of  Glencoe  Inc., 
1952 )  p .  2^9 

6-4 


6.1.3  The  median,  or  .5  fractile,  is  the  middle  observation. 
But  to  be  precise,  a  certain  amount  of  hair-splitting  is 
necessary.   It  is  a  number  in  a  set  of  observations  that 
neither  exceeds  nor  is  exceeded  by  more  than  half  the 
observations.   When  there  is  an  odd  number  of  observations, 
the  median  is  the  middle  observation;  when  there  is  an  even 
number  of  observations^  it  can  be  any  number  at  or  between 

the  two  middle  observations.  Thus,  it  may  be  a  value  not  in 

2 
any  of  the  observations. 

6.1.^  The  arithmetic  mean  is  simply  an  average  and  it  can  be 

found  in  three  ways: 

1)  The  most  common  way  is  to  add  the  set  of  values 
and  divide  by  the  number  of  values  comprising 
the  set. 

2)  When  the  values  of  a  random  variable  occur  re- 
petitively in  a  distribution,  the  arithmetic  mean 
can  be  found  by  multiplying  each  value  by  the 
number  of  individual  occurrences,  summing  the 
products  and  dividing  the  sum  by  the  total 
number  of  occurrences.   The  number  of  individual 
occurrences  may  be  considered  as  weights  and  the 
arithmetic  mean  as  a  weighted  average. 

3)  The  number  of  occurrences  (or  weights)  can  be 
converted  to  relative  frequencies  (whose  sum  is  1), 

2Ibld.  P.  216 
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multiplied  by  the  corresponding  value  of  the 
random  variable  and  summed  to  get  the  mean. 

6.1.5  The  mode  is  the  value  in  a  set  of  observations  that 
ocours  most  often.   Of  the  three  measures  of  location  in 
a  frequency  distribution,  the  mean  and  the  median  are  the 
principal  ones,  but  the  mode  also  has  its  useful  purposes. 

6.1.6  Graphical  examples  of  the  above  definitions  are 
illustrated  in  the  following  diagram  with  thumbnail  de-  • 
finitions: 


o 

0) 

& 

4) 

U 


Variable  (Z) 
Figure  6.2 
The  distribution  of  the  random  variable  (Z)  has  the 
following  locations: 
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A-the  minimum  value 

B-mode,  the  value  of  the  variable  (Z)  which 

has  the  greatest  number  of  observations 
C-median,  the  value  of  the  variable  which 

exceeds  half  of  the  observations  and  is 

exceeded  by  half 
D-mean,  the  value  ordinarily  referred  to  as 

"average" 
E-the  ninetieth  centile,  the  value  of  the 

random  variable  (Z)  which  exceeds  90  percent 

of  the  observations  and  is  exceeded  by  10 

percent. 

3 
F-the  maximan  value 

6.2  Linear  Profits  and  Costs 

6.2.1  When  we  compute  the  mean  of  a  probability  distribution 
of  a  random  variable,  we  are  computing  the  expected  value  of 
the  random  variable. 

In  many  cases  profit  (or  cost)  is  Just  a  special  case  of 
a  random  variable  and  is  the  mean  of  a  probability  distribution 
of  conditional  profits  (or  costs).  This  can  be  expressed  in 
the  form: 

Conditional  profit  or  cost  =  K  +  k  E  (  Z), 


3Ibld.  pp.  211-212 
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Where  K  and  k  are  constants  and  E  (Z)  is  the  expected 
value  or  mean  of  the  distribution  of  a  random  variable  Z. 
Problems 

1,   The  Navy  formerly  ran  its  own  coffee  roasting  and 
distribution  plant.   Assume  that  the  table  below  is 
the  monthly  demand  of  the  various  ships  in  the  Navy 
for  coffee.   Also  assume  that  the  plant  pays  $1.00 
per  pound  for  green  coffee,  that  grinding  and  dis- 
tribution costs  are  $.25  per  pound  sold  and  that  a 
profit  of  $.25  per  pound  is  authorized  as  a  con- 
tribution to  the  WAVES  Retirement  Fund.   If  the  Navy 
plant  runs  short,  the  demand  is  met  by  purchasing 
coffee  from  commercial  sources  at  $1.50  per  pound 
including  shipping  costs  to  the  requisitioning  activity. 
Demand  in  lbs   No.  of  ships   Demand  In  lbs   No.  of  ships 


5° 

21 

110 

114 

60 

51 

120 

69 

70 

53 

130 

*5 

80 

81 

140 

21 

90 

120 

150 

15 

100 

150 

a)  Considering  the  above  data  as  a  probability  distribution 
rather  than  a  frequency  distribution,  graph  P(Z  ^  Z) 
against  the  demand  Z, 
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b)  Compute  the  deciles  (P     F    •  •  .F    ). 

.1,    .2        1.0 

What  Is  the  value  F   ?  Answer :  110  to  120  lbs,  per 

.8 
ship. 

o)  What  Is  the  expected  monthly  demand  per  ship? 

Answer:  96.88  lbs. 

d)  In  anticipation  of  reduced  demand  for  oof fee  during 
the  summer  months  the  plant  maintains  a  total  stock 
of  60,000  lbs.   What  is  the  monthly  expected  loss  to 
the  Retirement  Fund  if  demand  falls  to  an  average  of 
85  lbs.  per  ship? 

Answer:  $937.50 

e)  If  ordering,  receiving  and  processing  green  coffee 
from  Brazil  takes  one  month  half  the  time,  two  months 
a  third  of  the  time,  and  three  months  the  remainder  of 
the  time,  how  much  coffee  should  be  maintained  in 
stock  to  meet  the  normal  expected  demand? 

Answer:  121,000  lbs. 

f )  Assume  that  there  is  no  longer  a  requirement  to  con- 
tribute to  the  WAVE  Retirement  Fund  but  the  plant  is 
still  authorized  to  make  $.25  per  pound  profit  and 
reduce  inventory  by  using  the  proceeds  to  purchase 
coffee  through  commercial  sources.   How  much  coffee 
should  the  Navy  plant  process  and  how  much  should  be 
obtained  commercially  In  order  to  Just  break  even? 
Answer:  62,281  lbs.  and  10,379  lbs.  respectively. 
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g)  Assume  that  coffee  not  sold  to  a  ship  by  the  end 
of  the  month  is  no  longer  consumable  and  must  be 
surveyed  at  a  loss  of  $1.00  per  pound,  its  purchase 
price.   If  the  plant  manager  uses  the  frequency 
distribution  in  the  above  table,  how  many  pounds 
per  ship  per  month  should  be  stocked? 
Answer:  80  lbs. 
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CHAPTER  7 

Assessment  of  -probabilities  by  historical  smoothing. 
I 

7.1  Relating;  probabilities  to  relative  frequencies. 

When  occurrence  of  a  number  of  events  are  converted 
Into  relative  frequencies,  it  is  possible  to  assess  pro- 
abilities  to  each  of  these  events  based  on  this  historical 
data.   The  historical  data  however,  may  generate  a  distrib- 
ution that  appears  irregular  and  is  difficult  to  manipulate 
On  such  occasions,  one  might  wish  to  adjust  the  assigned 
probabilities  to  approximate  a  smooth  curve  rather  than 
assess  probabilities  from  frequencies  derived  from  the  raw 
data  provided  that  there  are  no  assignable  causes  for  the 
variations. 

For  example,  take  the  following  106  time  lapses  (in 
minutes)  between  eruptions  of  Old  Faithful  Geyser  in 
Yellowstone  National  Park.   (they  were  recorded  between 
the  hours  of  3  am  and  10  pm  on  July  6-  July  1^,  1955  >  and 

they  were  measured  from  the  beginning  of  one  eruption  to 

1 
the  beginning  of  the  next). 


1 
John  E.  Freund,  and  Frank  J.  Williams,  Modern 
Business  Statistics ,  p.  31. 
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Table  7-1 

\  ,  / 

60  62  68  72  50  76  59  69  62  70  67  44  71 

63  62  65     69  62  70  40  62  74  59  68  66  68 

66     55     60  65  63  57  66  68  67  58  74  74  66 

63  62  61  66  67  67  7^  76  63  40  63  71  67 

61  67  71  64  52  76  63  71  76  60  74  41  72 
44  73  60  69  55  81  55  68  51  67  62  66  66 
60     72  68  59  69  55  58  60  69  58  66  63  68 

62  55  67  67  64  45  65  60  64  67  71  72  72 
41  67 


Table  7-2  is  the  data  rearranged  by  frequency  (f )  vs. 
time  intervals  (i)  and  is  plotted  in  figure  7-1. 

From  this  data,  one  could  assign  probabilities  to 
the  occurrence  of  each  event.  However,  some  events  would 
be  assigned  a  probability  of  zero  for  example,  P(i  =  47)  =  0* 
This  quite  obviously  is  not  a  very  good  assessment  of  that 
probability.   Suppose  this  date  is  reclassified  into  5 
minute  intervals.  Table  7-3  is  generated  on  this  premise. 
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Table  7-2 
Data  from  Old  Faithful  classified  in  1  minute  intervals. 


i     f 

i 

f 

1 

f 

i 

f 

40    1 

55 

5 

63 

6 

70 

3 

41    1 

57 

1 

64 

3 

71 

6 

44    2 

58 

3 

65 

4 

72 

5 

45    1 

59 

3 

66 

8 

73 

1 

50    1 

60 

6 

67    11 

74 

5 

51    1 

61 

2 

68 

8 

76 

4 

52    1 

62 

8 

Table 

69 
7-3 

5 

81 

1 

Data 

from  Old  Faithful  classified 

in 

5  minute 

intervals . 

Table 

1 

f 

p(T«  t 

L) 

P(l^i) 

40-44 

4 

.0378 

1, 

,000 

45-49 

1 

.0094 

1 

,9622 

50-54 

3 

.0283 

< 

.9528 

55-59 

12 

.1133 

1 

.9245 

60-64 

25 

„?:so 

■ 

,8112 

65-69 

36 

.3390 

a 

.5752 

70-74 

20 

.1890 

4 

,2362 

75-49 

4 

.0378 

1 

,0472 

80-84 

1 

.0094 

■ 

,0094 

To3  1.0000 
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Fig.  7-2 
P(i  =  1)  vs.  class  interval  of  5  minutes. 
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In  figure  7-2,  it  is  appa_  ...   iliat  the  distribution 
has  smoothed  out  considerably,  but  there  is  still  some  dis- 
parity at  the  left  end  of  the  distribution.   By  manipulating 
the  data,  some  more  irregularities  may  be  removed  such  as 
the  bimodal  aspect,  simply  by  changing  the  interval  and/or 
where  each  class  begins.   Suppose  that  the  data  is  reclass- 
ified into  ten  minute  intervals.   Table  7-4  is  generated  on 
this  premise. 
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Table  7-4 
Data  from  Old  Faithful  classified  in  ten  minute  intervals 


i 

f 

P(i) 

P(i  =  1) 

31-40 

1 

.0094 

1.0000 

41-50 

5 

.0472 

.9906 

51-60 

20 

.1885 

.9W 

61-70 

58 

.5^75 

.75^9 

71-80 

21 

.1980 

.2074 

81-90 

1 

.0094 

.0094 

106 


1 .  COO' 


Fig.  7-3 
P(i  =  i)  vs.  class  interval  of  ten  minutes. 
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In  figure  7-3  is  P(i  -  i)  plotted  against  a  class 
interval  of  10  minutes  derived  from  table  7-^.  Notice 
that  classification  intervals  can  begin  and  end  at  any- 
selected  point  on  the  x-axis.   By  merely  changing  these 

points  it  is  possible  to  make  profound  changes  in  a 
distribution  depending  on  the  diversity  of  data  and 
classification  intervals  selected.   It  should  also  be 
emphasized  that  although  tables  7-3  and  7-^  show  pro- 
babilities to  four  places,  they  are  not  significant  to 

four  places.   Additional  decimal  places  have  been  arbitrar- 
ily filled  in  beyond  the  significant  digits  to  complement 
all  probabilities  to  a  sum  of  1.000C. 

Figure  7-^  ^nd  7-5  are  plotted  on  normal  probability 
paper.  "Figure  7-b   is  a  plot  of  the  P(i  ^  i)  based  on  a 
classification  interval  of  5  minutes  and  figure  7-5  is  a 
plot  of  the  P(i  ^  1)  based  on  a  classification  interval 
of  10  minutes.   As  can  be  seen  in  the  illustrations,  proper 
selection  of  classification  interval  and  the  starting 
point  are  primary  factors  in  the  ;1_  termination  of  a  smooth 
curve  derived  from  empirical  data. 
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CHAPTER  8 
Conditional,  Joint ,  and  Marginal  Probability 
8.1  Theory  of  Probability 

8.1.1  The  theory  of  probability  Is  a  branch  of  mathematics 
and  one  of  the  applications  of  this  theory  is  statistics. 
Probability,  chances  and  likelihood  are  words  that  are  common 
in  everyday  conversation — the  probability  that  it  will  rain 
tomMrrow,  the  chances  that  the  Yankees  will  win  the  Pennant 
this  year,  the  likelihood  that  there  is  life  on  Mars,  etc. 
However,  in  order  to  be  useful,  mathematical  probability 
should  be  expressed  quantitatively.   Probabilities  used  in 
business  may  be  based  on  past  experience,  a  collection  of 
data,  sampling,  or  an  estimate  of  the  future.   In  the 
application  of  probability  theory  to  business  and  industry, 
it  is  important  not  to  confuse  probability  with  frequency 
even  though  the  two  can  often  be  equated  (see  section  3«2 

of  Schlaifer). 

8.1.2  For  our  purposes,  probability  can  be  broken  down  into 
three  concepts:  joint,  conditional  and  marginal.   Basically: 

a)  Joint  probability  is  the  probability  that  two  or  more 
events  will  all  occur;  each  is  Independent  of  the  other. 

b)  Conditional  probability  is  the  probability  that  an  event 
will  occur  on  condition  that  another  event  has  already 
occurred  or  would  occur;  the  second  event  is  dependent 
on  the  first;  this  definition  also  holds  true  for  more 
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than  two  events, 
c)  Marginal  probability  of  an  event  is  the  sum  of  the 

joint  probabilities  making  up  the  event. 
8.1.3  The  above  definitions  can  be  illustrated  by  the  follow- 
ing example: 

Table  8-1  Age  and  Sex  of  Employees  of  Corporation 

Age  Male  Bi   Male  B2     Total 

At        <  3^ 
A2        3^-5^ 
Ao       >  55 


k3 


2,100 

900 

3,000 

^,200 

1,800 

6,000 

700 

300 

1,000 

7,000     3,000      10,000 


Joint  Probabillty-The  probability  that  an  employee  selected 
at  random  is  of  a  specified  sex  and  age  is  a  joint  prob- 
ability because  two  characteristics  of  the  random  selection 

have  been  specified.  The  probability  of  selecting  a  male 

7,000 
worker  at  random  is  P(B^)  =  ..  QQQ  «  .7.   The  probability 

of  selecting  a  male  worker  under  3^  years  old  is  the  joint 

probability  PU^)  =  -i^££_  =  .21 

Conditional  Probabllity-The  probability  of  selecting  an 
employee  less  than  3^  years  old  when  it  is  known  that  he  is 
male  is  a  conditional  probability  because  one  event  is 
already  known,  namely,  the  condition  that  he  is  male. 


Then,  P(A1|b1)  =  — \ 

7 » 


2,100 


000 


=  .3 
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Marginal  Probabllity-The  marginal  probabilities  appear  in 
the  summation  column  and  row  of  the  above  table,  thus  the 
term  "marginal".  The  marginal  distribution  of  sex  of  employees 
is:  male  ■  .7  and  female  =  .3.  The  marginal  distribution 
by  age  groups  are:<3^  =  .3,  3^-54  =  .6  and  -  55   =  •!•   Note 
that  the  sum  of  each  margin  is  1. 

The  above  explanation  may  be  made  clearer  by  restating 
Table  8.1  in  the  usual  probability  function: 

Table  8.2  Frequency  Distribution  of  Employees  by  Age  and 


Sex 

Age 

Male 

Bl 

Female 

5~ 

Total 

Al 

<  3^ 

..21 

.09 

.30 

A2 

3^-5^ 

.42 

.18 

.60 

A3 

>  55 

.07 
.70 

.03 
.30 

.10 
1.00\ 

If  we  want  to  know  the  probability  that  a  person 
selected  at  random  is  either  male  or  under  3^  years  old,  we 
can  add  the  probabilities  of  each,  but  we  must  be  careful 
not  to  add  the  same  group  twice.   Thus  the  probability 
that  a  person  is  male  is  .7  and  the  probability  that  a 
person  is  under  3^  years  old  Is  .3  but  we  have  counted  the 
male  under  Jk   twice.  Therefore  the  answer  is  .7  +  .3  -  .21  =  .79 
The  answer  could  also  have  been  obtained  in  several  other 
ways,  e.g.;  the  probability  that  a  person  is  male  is  .7, 
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but  to  Include  all  the  people  under  3^  we  must  also  in- 
clude females,  .7  +  .09  =  .79. 
Try  similar  reasoning  for  .3  +  .42  +  .07  -   .79 
Problems 

1«   If  70$  of  collisions  at  sea  occur  during  reduced 

visibility  and  60%   occur  in  the  approaches  to  major 

world  harbors, 

a)  What  per  cent  occur  in  the  open  sea  in  good 
visibility? 

b)  In  reduced  visibility  in  the  open  sea? 
Answers:  a)  12$,  b)  28^ 

2.   In  the  above  problem  given  that  a  collision  occured 
in  the  open  sea,  what  is  the  probability  that 
visibility  was  good?  Answer:  .3 

3«   In  the  destroyer  force  a  ship  can  earn  a  permanent 
gold  "E"  if  she  wins  the  Battle  Efficiency  Award 
five  years  in  a  row.   What  is  the  probability  that 
a  relatively  new  ship  which  won  no  awards  her  first 
two  years  in  commission  then  won  three  consecutive 
awards  can  win  the  gold  "E"?  Assume  that  there  are 
eight  ships  in  the  squadron  and  (although  unrealistic) 
they  are  evenly  matched  and  therefore,  have  equal 
opportunity  to  win. 
Answer:  .0156 

4.   If  20^  of  the  men  in  the  Navy  are  not  married  and 

70$  of  those  who  are  have  two  or  more  children  while 
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30%   have  three  or  more,  what  per  cent  are  married 
with  one  or  no  children? 
Answer :  2h% 
5»   Seventy  per  cent  of  the  time  wholesale  and  retail 
prices  move  in  the  same  direction.  Sixty  per  cent 
of  the  time  when  they  move  together  they  are  moving 
upward,  and  seventy  five  per  cent  of  the  time  when 
they  move  upward  together  a  prosperous  period 
follows;  twenty  five  per  cent  of  the  time  bad 
times  follow.   There  are  as  many  prosperous  periods 
as  bad  periods.   When  prices  move  in  opposite 
directions,  the  odds  are  even  that  the  following 
period  will  be  prosperous  or  bad.   What  Is  the 
probability  that  there  will  be  a  downward  movement 
in  both  prices  followed  by  a  prosperous  period? 
Answer:  .035 
■  Hint:  Try  to  get  the  answer  by  filling  in  the  rest 
of  this  table : 


Bad  Times 


Same 

.70 

Opposite 

up 

down 

.50 

1  Perlman,  Richard,  Problems  in  Statistics  for  ftoonomics  and 
Business  Students.  New  York:  Holt  Rinehart  and  Winston," 
Inc.,  1963.  P.  27 
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6#  Three  destroyers  in  a  ASW  exercise  have  a  past 
record  of  making  contact  20,  30,  and  ^0  per  cent 
of  the  time,  respectively.  Assuming  that  there  is 
a  different  submarine  in  the  vicinity  of  each 
destroyer,  (a)  what  is  the  probability  that  all 
make  contact?  (b)  that  at  least  two  will  make  contact? 
(o)  that  none  make  contact? 
Answers:  a)  .02^,  b)  .212,  c)  .336 

7«  A  cruiser  is  attacked  by  four  airplanes  in  succession. 
If  her  past  performance  has  shown  an  ability  to  shoot 
down  k0%   of  attacking  aircraft,  (a)  what  is  the 
probability  of  shooting  down  exactly  one  plane? 
(b)  of  shooting  down  all  four?  (c)  of  shooting  down 
at  least  two? 
Answers:  a)  .3^56,  b)  .0256,  c)  .52^8 

8,  A  dealer  holds  2  aces  and  k   dueces  in  his  hand. 

1  He  offers  to  form  a  pool  of  $60  to  be  paid  to  the 
first  man  among  five  to  draw  an  ace.   If  A,  B,  C,  D, 
and  E  are  going  to  draw  in  that  order,  how  much  should 
each  pay  to  play?  A  card  once  drawn  is  not  replaced. 
Answers:  A-$20,  B-$l6,  C-$12,  D-$8,  E-$*f 

9.  Comment  on  the  following  quotation: 

•  •  •  .Nothing  •  •  .is  more  difficult  than  to  con- 
vince the  merely  general  reader  that  the  fact  of 
sixes  having  been  thrown  twice  in  succession  by  a 
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player  at  dice,  is  sufficient  cause  for  betting 
the  largest  odds  that  sixes  will  not  be  thrown  in 
the  third  attempt.   A  suggestion  to  this  effect  is 
usually  rejected  by  the  intellect  at  once.   It  does 
not  appear  that  the  two  throws  which  have  been  completed, 
and  which  lie  now  absolutely  in  the  Past,  can  have 
influence  upon  the  throw  which  exists  in  the  Future. 
The  chance  for  throwing  sixes  seems  to  be  precisely 
as  it  was  at  any  ordinary  time  .  .  .  .And  this  is  a 
reflection  which  arrears  30  exceedingly  obvious  that 
attempts  to  controvert  it  are  receivad  more  frequently 
with  a  derisive  smile  than  with  anything  like  respect- 
ful attention.   The  error  here  involved — a  gross  error 
redolent  of  mischief — I  cannot  pretend  to  expose  within 

the  limits  assigned  me  at  present;  and  with  the 

1 
philosophical  it  needs  no  exposure   


I 

Edgar  Allan  Poe,  "The  Mystery  of  Marie  Roget",  final 

paragraph  in  Murders  in  the  Rue  Morgue. 
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CHAPTER  9 
The  Binoir/  -1  .Distribution 
9. 1   Introduction. 

In  chapter  3»  a  binomial  distribution  was  developed 
using  a  ten-sided  die  as  a  model  and  if  you  worked  the 
first  problem  at  the  >snd  of  that  chapter,  you  developed 
one  yourself.   The  binomial  probability  is  written  in 
Schlaifer  as  P(r)  =  c£  pr  qn~r.   It  is  an  expression  for 
any  term  of  the  binomial  expansion.   For  example,  let's 
determine  the  fourth  term  of  an  expansion  of  the  form 
(p  +  q)'.   It  can  be  expressed  P(4)  -  Ci,  p  q^.  By  perform- 
ing  the  indicated  opor^tion  we  have 
P(4)  „  T_^_  p^  q3 


P(4)  =  35  P^  q3 

By  the  method  used  in  chapter  3,  the  4th  term  coef- 
ficient from  Pascal's  triangle  is  35  and  the  binomial  is 
p  q^,  so  again  the  4th  term  is  35  P  q  . 

Any  binomial  distribution  can  be  developed  in  the 
same  manner.   It  is  important  to  remember  the  relation- 
ship between  p  and  q.   The  assignment  of  a  value  to  either 
p  or  q  implicitly  determines  the  other  since  p  is  the  1's 
complement  of  q  and  vice  versa,  i.e.,  p  =  1  -  q.   With 
this  in  mind  it  is  easy  to  see  that  there  are  two  parameters 
that  determine  a  binomial  distribution,  n  and  p.   In  the 
dice  model,  p  is  the  probability  that  is  assigned  to  a 
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particular  face  coming  up  on  the  die,  q  is  the  probability 
of  the  same  face  not  coming  up,  n  is  the  number  of  throws 
or  trials  in  a  series,  and  r  is  the  number  of  times  that 
particular  face  comes  up.   It  is  sometime?  more  convenient 
to  think  of  r  in  terms  of  the  number  of  successes.   Thinking 
of  the  die  model  is  one  way  of  keeping  the  parameters  of  the 
binomial  distribution  in  perspective. 

In  the  next  section  it  will  be  shown  how  the  binomial 
distribution  is  developed  and  conveniently  structured  into 
tables  thus  making  it  unnecessary  to  calculate  each  dis- 
tribution. 
9.2  Derivation  of  a  specific  Cumulative  Binomial  Distribution. 

A  key  to  the  use  and  application  of  Table  I  in  Schlalfer, 
is  an  understanding  of  how  the  table  values  -;ere  derived, 
and  the  organization  oT   the  data.   To  illustrate  a  der- 
ivation of  part  of  the  tables,  let's  derive 
P(r  ^  r|n=9,  p=.^0). 

In  the  left  hand  column  of  Table  9-1  is  the  binomial 
distribution  of  P(r  ^  rln=9,  p=.i0  reproduced  from  Table  I 
in  Schlalfer.   To  the  right  of  the  vertical  line  in  Table 
9-1,  are  the  calculations  which  derive  this  same  distri- 
bution to  nine  places.   These  calculations  were  made  of 
this  particular  distribution  because  the  probabilities  are 
finite  in  nine  places  and  it  again  emphatically  demonstrates 
that  the  sum  of  the  probabilities  of  all  events  in  a  distri- 
bution equal  1.000000000. 
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Table  I  in  Schlalfer  rounds  off  the  probabilities  to 
the  4th  place.   By  rounding  the  right  hand  column  in  Table 
9-1  to  four  places,  the  calculated  quantities  agree  with 
Schlaifer.   Notice  that  Table  I  in  Schlaifer  provides 
distributions  for  parameter  n  of  values  1  through  20,  50, 
and  100,  and  for  parameter  p,  of  values  .01  to  .50  inclus- 
ive.  The  tables  for  values  of  p  from  .51  to  .99  can  easily 
be  derived  from  Table  I.   The  next  section  will  show  how 
this  can  be  done. 

9.3  Obtaining  Cumulative  Binomial  Probabilities  from  Table 
I  in  Schlaifer  when  p  ^  . 50. 

Examine  the  formula  by  which  the  binomial  distribution 
was  developed  in  Table  9-1 J 

P(r  =  r|n=9,  p=.40)  =  C9  .40r  (1.00  -  .40)9~r 


C9  .40r  (.60)9~r 


T 


For  parameters  n  =  9»  P  =  .60,  the  formula  is 
P(r  =  r|n=9,  p=.60)  =  C9  .60r  (1.00  -  .40)9~r. 

=  C9  .60r  (.4C)9^ 

=  c9  .4o9~r  (.6o)r 

By  comparing  formulas  (l)  and  (5),  one  can  see  that 

a)  the  coefficients  are  the  same, 

b)  the  quantities  calculated  from  (2)  are  also 
calculated  from  (5)  except  in  the  inverse  order, 

c)  and  it  follows  that 


(1) 

(2) 

(3) 

(4) 

(5) 
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of  1,  2,  3,  and  k   =  P(r  ^  5(n=9,  P=.4), 

c)  in  the  3rd  quadrant,  the  sum  of  the  probabilities 
of  9,  8,  7,  -6,  and  5  =  ?(r  *  5|n=9,  P=.6), 

d)  in  the  4th  quadrant,  the  sum  of  the  probabilities 
of  4,  3,  2,  1,  and  0  =  P(r  *   5ln=9,  p=.6). 

The  sum  of  the  probabilities  in  the  1st  and  4th 
quadrants  are  equal,  and  the  sum  of  the  probabilities 
in  the  2nd  and  3rd  quadrants  are  equal.   Table  I  in 
Schlaifer  gives  the  sum  of  the  probabilities  in  the  1st 
quadrant.   Since  we  know  that  the  sum  of  the  probabilities 
of  events  0  to  9  is  equal  to  1.0000,  then  it  follows  that 
the  sum  of  the  probabilities  ol"  .;...=  2r.^     rant  is  equal 
to  1.0000  minus  the  sum  of  the  probabilities  in  the  1st' 
quadrant.  Thus,  the  sum  of  the  probabilities  in  the  3rd 
and  4th  quadrants  are  defined. 
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P(r  =  0|n=9,  P=.^0) 
P(r  =  l|n=9,  p=.^0) 
P(r  *  2.|n=9,  p=.40) 
P(r  =  3|n=9,  p=.^0) 

•     •     •     6 wC  • 


P(r 
P(r 
P(r 


=  P(£  = 


9|n=9,  p=.60) 

9|n=9,  p=.60) 

9|n=9,  p=.60) 

9in=9,  p=?.60) 


A  convenient  way  to  illustrate  (c)  above  is  to  draw 
a  pictorial  diagram  as  follows: 

Fig.  9-1 
"  A  Pictorial  Diagram. 


1st  quadrant 


3rd  quadrant 


Jj-th  quadrant 


In  Fig.  9-1 »  the  discrete  "binomial  distribution  is 
plotted  to  scale  for  the  sake  of  explanation.  From  the 
diagram,  it  can  be  seen  that 

a)  in  the  1st  quadrant,  the  sum  of  the  probabilities 
for  5,  6,  7,  8,  and  9  -  P(r  ^  5|n=9,  P=.*K>), 

b)  in  the  2nd  quadrant,  the  sum  of  the  probabilities 


9-6 


Problems: 

9-1.   You  are  the  commanding  officer  of  a  destroyer  that 
is  undergoing  refresher  training  in  San  Diego.   The  ship 
is  required  to  shoot  an  anti-aircraft  gunnery  exercise 
using  VT  non-frag  ammunition  which  if  not  triggered  by  the 
target,  is  designed  to  self -destroy  before  water  entry. 
In  two  practice  exercise  firings  of  100  rounds  per  exercise, 
the  ship  has  scored  unsatisfactorily.   The  gunnery  officer 
swears  that  the  battery  is  properly  aligned  and  that  the 
cause  of  the  unsatisfactory  scores  must  be  the  ammunition. 
You  know  that  the  lots  are  supposed  to  be  no  more  than  b 
per  cent  defective.   Your  observers  report  that  7  rounds 
out  of  the  first  IOC  did  not  self -destroy  and  that  9  rounds 
out  of  the  second  100  did  not  self -destroy.   Assuming  a 
p  =  ,0k,   what  probabilities  would  you  assign  to  these  two 
events?  What  could  you  hypothesize  about  the  per  cent 
defective? 

9-2.   A  navy  supply  center  has  100  excess  typewriters  in 
various  stages  of  repair.  A  notice  is  sent  out  to  all  ships 
in  port  that  these  typewriters  will  be  issued  on  a  first 
come,  first  served,  no  cost  basis  with  a  limit  of  3  to 
destroyers;  9  to  DLG's  and  15  to  cruisers.   Some  of  these 
typewriters  are  beyond  economical  repair.   No  choice  among 
the  typewriters  will  be  permitted.  Assuming  that  all  ships 
draitf  their  limit: 
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CHAPTER  10 
Statistical  Decisions  Rules  and  Their  Error  Characteristics 
10.1  Terms 

Decision  makers  must  often  make  a  choice  between  two 
alternatives  such  as-whether  to  publish  or  not;  whether  to 
market  a  new  product  or  not;  whether  a  new  development  (such 
as  a  drug)  Is  effective  or  not;  whether  an  Industrial  (or 
production)  process  Is  satisfactory  or  not;  and  In  the  social 
sciences  whether  certain  data  should  be  "believed"  or  not. 
Making  a  choice  on  matters  of  this  type  can  be  done  by 
statistical  analysis  of  random  sampling,  which  leads  to  some 
new  terms — Null  Hypothesis,  Alternate  Hypothesis,  -ype  I 
Error,  Type  II  Error,  Producer's  Risk,  Consumer's  Risk  and 
Tests  of  Significance. 
10.2  The  Null  Hypothesis 

The  statistical  hypothesis  approach,  in  attempting  to 
make  a  decision  or  reach  an  opinion,  is  to  make  an  assumption 
for  the  sole  purpose  of  either  accepting  it  or  rejecting  it. 
For  example,  we  make  an  assumption  that  a  coin  is  fair:  that 
is,  the  probability  of  getting  heads  on  n  tosses  is  p  ■  .5# 
This  we  might  call  the  null  hypothesis.   If  we  toss  the  coin 
100  times  and  get  60  heads  we  might  then  reject  the  null 
hypothesis  and  accept  the  alternate,  that  the  coin  is  not 
fair.  Generally,  the  null  hypothesis  may  be  thought  of  as 
the  status  quo  and  leads  to  no  action.  However,  this  is  not 
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necessarily  true,  for  the  hypothesis  can  be  stated  in  a  number 
of  ways.  The  alternate  hypothesis  is  the  opposite  of  the  null 
and  leads  to  an  opposite  course  of  action. 
10,3  Type  I  and  Type  II  Error 

Suppose  that  as  a  result  of  random  sampling  we  are  lead 
to  a  certain  course  of  action.   But  what  if  the  sampling, 
even  though  properly  done,  has  produced  untrue  information? 
If  we  acted  on  this  untrue  information  we  have  made  an  error. 
Returning  to  the  example  of  the  coin  with  a  null  hypothesis 
that  it  was  fair  but  subsequent  acceptance  of  the  alternate 
based  on  a  sample  of  100  tosses.   Suppose  that  we  toss  the 
same  coin  an  additional  1000  times  obtaining  510  heads  and 
^90  tails,  and  further  that  by  a  scientifically  accurate 
balancing  test  we  find  that  the  coin  was  in  fact  fair.   Then, 
having  previously  rejected  the  null  when  in  fact  it  was  true, 
we  have  made  an  error.   This  is  called  a  Type  I  error.   A 
Type  II  error  is  Just  the  opposite-to  accept  the  null  hy- 
pothesis when  it  is  false -example:  toss  a  coin  100  times  with 
k&   heads  and  52  tails  leading  to  the  null  hypothesis  that 
the  coin  is  fair  but  subsequent  tosses  with  ^20  heads  and 
580  tails  out  of  a  thousand  indicate  that  the  coin  is  in 
fact  not  fair;  then  an  error,  Type  II,  has  been  made  in 
accepting  the  null  hypothesis  when  it  was  false. 
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This  terminology  may  be  illustrated  by  the  following  table: 

Table  10.1 


Reality 

Decision 

Accept  the  Null 

Accept  the  Alt, 

Null  hyp.  is  true 

No  Error 

Type  I  Error 

Alt.  hyp.  is  true 

Type  II  Error 

No  Error 

10.4  Producer's  and  Consumer's  Risk 

When  testing  any  hypothesis  by  sampling  techniques; 
the  decision  rule,  in  order  to  be  useful,  must  minimize  the 
probability  of  error.  The  difficulty  here  is  that  minimiz- 
ing one  type  of  error  is  usually  accompanied  by  an  increase 
in  the  other.  For  example,  a  drug  company  desires  to  test 
a  new  remedy  on  a  sample  of  100  patients  in  a  hospital.  A 
substantial  amount  of  data  over  a  period  of  years  has  lead 
to  the  conclusion  that  the  old  remedy  is  50%   effective. 
Therefore,  if  the  new  remedy  results  in  $\%   or  more  recover- 
ies, it  is  better  than  the  old  and  should  be  adopted.  We 
set  the  null  hypothesis  that  the  new  remedy  is  no  more 
effective  than  the  old  and  therefore  the  true  proportion  of 
recoveries  will  be  .5  or  less.   If  in  the  test  we  get  51 
recoveries,  we  might  assume  that  the  null  hypothesis  is 
false  and  that  the  new  remedy  is  better  than  the  old.   It 
is  intuitively  obvious  that  this  improvement  from  50  to  51 
percent  might  very  well  have  been  by  chance  alone  and  there - 
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fore  we  might  make  a  Type  I  error  by  rejecting  the  null 
hypothesis  when  In  fact  It  was  true*   So  In  order  to  reduce 
the  possibility  that  the  test  results  are  from  chance  alone, 
we  might  set  the  decision  rule  at,  say,  56  recoveries.   The 
probability  of  making  a  Type  I  error  Is  now  reduced:  that 
Is,  if  we  state  that  In  order  for  the  new  remedy  to  be  con- 
sidered better  than  the  old  there  must  be  a  $6%   recovery 
as  opposed  to  the  old  50% t   then  we  are  more  sure  of  our 
test  and  less  likely  to  make  a  Type  I  error.   However,  we 
have  now  Increased  the  probability  of  a  Type  II  error;  for, 
If  In  fact  the  new  remedy  Is  more  effective  than  the  old, 
we  have,  by  Increasing  the  level  of  the  test  It  must  meet, 
reduced  the  probability  of  finding  this  out.   By  this  de- 
cision rule  a  remedy  that  was  actually  5*$  effective  might 
very  well  be  rejected.   It  would  be  wrong  to  reject  the  new 
remedy,  and  by  so  doing  a  Type  II  error  would  be  made.   Thus 
it  may  be  seen  that  in  this  example,  as  in  many  others, 
especially  in  marketing  and  quality  control,  the  probability 
of  making  one  type  of  error  can  only  be  reduced  at  the  ex- 
pense of  increasing  the  probability  of  making  the  other  type 
of  error;  both  depend  on  the  true  value  of  the  unknown  p 
that  is  being  tested.   Errors  of  the  type  described  above 
are  also  sometimes  referred  to  as  producer's  and  consumer's 
risk;  the  producer's  risk  is  the  probability  that  his  improved 
product  will  be  rejeoted  and  the  consumer's  risk  is  the 
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probability  that  an  unimproved  product  will  be  accepted. 
Both  types  of  errors  can  be  reduced  by  taking  larger  samples 
and  the  nearer  the  sample  size  approaches  the  true  population 
size,  the  more  accurate  will  the  decision  rule  prove  to  be. 
But  the  larger  the  sample  size,  the  more  expensive  and 
time  consuming  the  testing  process, 

10.5  Significance  Level 

Realizing  that  it  might  be  unduly  arbitrary  to  set  a 
decision  rule  too  rigorously,  we  might  want  to  restate  It 
by  setting  a  maximum  probability  by  which  we  would  be  willing 
to  risk  an  error  in  our  hypothesis;  that  is,  we  might  be 
willing  to  accept  a  $%   probability  of  error;  then  we  would 
be  95#  confident  that  we  have  made  the  right  decision.   The 
maximum  probability  with  which  we  would  be  willing  to  accept 
a  Type  I  error  is  called  the  significance  level  and  is  de- 
noted by  o<( Producer's  risk).   The  maximum  probability  with 
which  we  would  be  willing  to  accept  a  Type  II  error  is  de- 
noted by  B    (consumer's  risk).   Significance  levels  are 
most  often  set  at  .05  and  .01,  but  other  significance 
levels  can  be  used  and  often  are,  particularly  in  medical 
and  psychiatric  statistics. 

10.6  Decision  Rules 

Business  problems  of  the  type  discussed  in  the  basic 
text  can  be  solved  by  first  computing  a  breakeven  point  and 
stating  the  null  hypothesis  in  terms  of  the  breakeven  point 
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and  then  setting  up  an  opportunity  loss  table.  Next  a 
decision  is  made  on  Just  how  large  a  risk  the  business  (or 
producer)  can  take  and  how  large  the  consumer  can  take.   If 
it  is  desired  to  balance  the  two  types  of  risks,  the  former 
X     ,  and  the  latter,  A   ,  are  set  equal  to  each  other.  With 
this  information  and  a  set  of  binomial  tables  an  (n,c) 
decision  rule  can  then  be  found  where  n  is  the  sample  size 
and  c  is  the  critical  (rejection  or  acceptance)  number.   If 
the  two  types  of  risks  are  not  balanced,  then  the  business- 
man (or  producer)  must  decide  the  greatest  chance  of  risk, 
QC    ,  that  he  is  willing  to  take  and  then  choose  an  (n,c) 
decision  rule  that  meets  this  criterion  while  at  the  same 
time  reducing  the  probability  of  Type  II  error  as  much  as 
possible. 

Problems 

1.  At  a  certain  critical  point  in  a  production 

process  a  component  must  be  tested  at  the  man- 

ufacturerS  expense.   If  the  manufacturer  will 

accept  a  risk  of  5%   of  having  what  he  considers 

a  good  lot  (that  is,  6%   or  less  defective) 

rejected  what  rejection  number  should  he  choose 

from  a  sample  of  50? 

Solution:  given i   n  ■  50,  &   =  »05   ,  p  =  .06 

from  the  binomial  tables:  R(6)  -  .0776 

R(7)  -..0289 
a  rejection  number  of  7  would  limit  the 
manufacturers  risk  to  2.89#  but  6  would  limit 
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it  to  7,76%,      Since  the  manufacturer  does 
not  want  his  risk  to  be  any  higher  than  5%t 
he  must  choose  the  sampling  plan  (7»50)» 

2.  Promotion  material  on  a  new  drug  claims  that 
it  is  90#  effective. 

a)  If  this  is  true,  what  is  the  probability 
that  in  a  sample  of  100  there  would  be  11 

,  non-effectives? 
Answer:  ,kl68 

b)  If  the  Null  Hypothesis  is  that  the  new  drug 
is  no  more  effective  than  the  old,  how 
many  non-effectives  would  have  to  turn  up 
in  the  sample  for  it  to  be  statistically 
significant  at  the  .05  level? 

Answer:  16 

c)  If  10  non-effectives  were  found  in  a 
sample  what  is  its  statistical  significance? 

3.  In  a  Bernoulli  process  of  sample  size  20  and 
defectives   p  ■  .25,  find  the  following: 

a)  The  probability  of  6  or  more  defectives. 
Answer:  .3828 

b)  The  probability  of  6  or  less  defectives. 
Answer:   .7858 

c)  The  probability  of  exactly  6  defectives. 
Answer:   .1686 
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d)  The  probability  of  5  to  8  defectives 
inclusive. 

Answer  i  .5443 

e)  The  probability  of  14  or  less  non-defectives. 
Answer:  .3828 

f )  The  probability  of  1*+  or  more  non-defectives. 
Answer:  .7858 

g)  The  expected  number  of  defectives. 
Answer :  5 

4.   A  contract  has  been  negotiated  between  a  consumer 
and  a  manufacturer  for  1,000  spare  parts  at  $5 
each. 

a)  If  the  consumer  considers  it  a  serious  risk 
to  accept  any  lot  in  which  50  or  more  are 
defective  and  the  manufacturer  considers  it 
a  serious  risk  to  reject  any  lot  in  which  30 
or  less  are  defective,  what  rejection  number 
in  a  sampling  plan  of  100  should  they  agree 
on  in  order  to  balance  their  risks? 
olutlon:  n  «  100,  p1  =  .03  p2  =  .05,  cC  = fr  =  ? 

_£ /-/3 

rejection  c  =  3,   .5802      .1183 

4,  .3528      .2578 

5,  .1821      .4360 
Their  risks  are  most  closely  balanced  at 
rejection  o  =  4. 
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b)  If,  instead  of  balancing  the  risks,  the 
consumer  agrees  that  he  would  be  willing 
to  spend  an  additional  $400  but  no  more  on 
replacing  defective  parts  previously  accept- 
ed, what  sampling  plan  should  be  used  assum- 
ing that  he  wants  to  limit  the  risk  of 
this  $400  additional  expenditure  to  .10? 
Solution:   n  -  100,  p  =  **00 =  .08,/$  *  .10 

5000      / 

Rejection  c  ■  5,   1r 

1-.9097  =.0903 

6,   1-.8201  «  .1799 
The  rejection  number  should  therefore 

be  c  =  5 
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CHAPTER  11 
Evaluation  of  Statistical  Decision  Rules  in  Terms  of 

Expected  Loss 

1 
11.1  The  Terminal  decision  without  sampling. 

Suppose  a  manufacturer  is  engaged  in  the  manufacture 
of  electron  tubes  for  the  U.  S.  Navy.   Type  18  C  355  must 
be  built  to  very  close  tolerances  to  have  them  perform 
satisfactorily.   They  require  a  coating  of  radioactive 
cobalt,  grade  A  or  grade  B.   Grade  A  cobalt  is  consider- 
ably more  expensive  than  grade  B.   The  tubes  manufactured 
with  grade  A  cobalt  have  80  defectives  per  batch  of  1000 
tubes.   Tubes  manufactured  with  grade  3  cobalt  are  much 
more  unpredictable.   Previous  experience  has  shown  that 
the  run  characteristic  using  grade  3  cobalt  has  produced 
the  following  data  which  is  considered  valid. 

Table  11-1 


P(p) 


No.  of  defectives  ner  1000 


.12  defective  .12 

.20  defective  .20 

.35  defective  .35 

.45  defective  .30 

.55   defective  .03 
1.00 


120 
200 

350 
k50 
550 


Bierman,  Fouraker,  and  Jaedicke.  Quantitative  Analysis 
for  Business  Decisions.  Pp.  161-171. 
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The  tubes  cost  $80  apiece  to  manufacture  using  grade 
A  cobalt.   If  grade  B  cobalt  is  used,  the  tubes  cost  $50 
apiece.   The'  additional  manufacturing  cost  is  caused  by  the 
difference  in  cost  of  the  two  grades  of  radioactive  cobalt. 
Because  of  the  rapid  deterioration  of  the  catalytic  fusion 
agent,  the  radioactive  cobalt  is  prepared  in  batches  just 
large  enough  to  fulfill  the  needs  of  each  1000  tube  run. 
The  cost  of  reworkine;  a  defective  tube  is  $100. 

If  tubes  come  out  on  a  run  having  a  run  characteristic 
of  .12,  .20,  or  .35,  the  manufacturer  would  want  to  use 
grade  B  radioactive  cobalt  because  the  overall  costs  are 
less  than  if  he  use;  grade  A  cor  ..-.  .   If  the  tubes  came  out 
on  a  run  having  a  run  characteristic  of   -5  or  ,55i    the 
manufactur.:.-  would  want  to  use  grade  A  cobalt  because  the 
overall  costs  are  less  than  if  he  used  grade  B  cobalt. 
11.2  Computation  of  the  breakeven  fraction 

a)  Grade  B  cobalt  cost  per  run: 

Cost  to  rework  =  1000  o  x  ftlOO  =  8100,000  p   (1) 

b)  Grade  A  cobalt  cost  per  ru  : 

Cost  to  rework  =  1000  x  .C     ;100  =  $8,000   (2) 

Additional 

cost  of  Grade  A=  1000  x  $30        =  30,000 

Total    $38,000 

The  point  at  which  the  manufacturer  would  be  indifferent 

as  to  which  grade  of  cobalt  to  use  is  found  by  equating  (1) 

and  ( 2 )  above : 
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Table  11-2* 
Expected  costs  of  immediate  Terminal  Decision 


ACT 

Use 

Grade  A 

Use 

Grade  B 

p 

P(p) 

Cond. 
Cost 

Expected 
Cost 

Cond. 
Cost 

Expected 
Cost 

.12 

.08 

038,000 

$3,0*1*0 

$12,000 

•  960 

.20 

.15 

38,000 

5,?oo 

20,000 

3,000 

.35 

.27 

38,000 

10,260 

35,000 

9,^50 

•^ 

.35 

38,000 

1*3  *3  r\  r\ 

•^5,000 

15,750 

.55 

.15 

•3£5COO 

3,  /Oo 

55,000 

8,250 

l.oo  i38,ooo         337, 2ao 


*Note:   The  fixed  costs  of  #50,000  for  either  act  is 
omitted  for  simplicity. 
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#100,000  pb  =  138,000 

Pb  =     .33 
(Note:  p  irTreality  could  be  other  values  than  those 
listed  in  Table  11-1,  however,  in  this  example,  they 
are  not  considered  possible)  , 

The  manufacturer  now  is  faced  with  a  decision  of 
whether  he  should  sample  or  not  sample  the  tubes  coming 
off  the  production  line  on  the  first  part  of  each  1CCC 
tube  run.   Secondly,  if  he  does  decide  to  sample,  what 
decision  rule  should  he  use?   If  the  decision  is  not  to 
sample,  then  Table  11-2  summarizes  the  Expected  Cost  of 
the  two  acts,  i.e..,  1)  use  grade  A  cobalt  or,  2)  use  grade 
B  cobalt.   Oi'ivi  can  see  that  be  make  a  terminal  decision  at 
this  point  he  should  use  gra^  5  cobalt  because  the  expected 
cost  is  0590  cheaper  than  to  use  grade  A  cobalt.   So  a 
decision  must  be  whether  to  act  now  on  the  basis  of  this 
information  or  to  delay  action  in  order  to  gather  further 
information  by  sampling.   The  manufacturer,  for  example, 
could  start  using  grade  B  cobr.lt  and  take  a  sample  of 
tubes  from  the  production  line.   Based  on  the  number  of 
defectives  found  in  the  sample,  he  would  decide  whether  to 
continue  this  run  with  grade  B  cobalt  or  shift  to  grade  A 
cobalt.   Table  11-3  computes  the  Expected  Cost  under 
Certainty: 
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Table 

11-3 

Expected  Cost 

under  Certainty 

p 

P(p) 

Best  Cond. 
Cost  Given 
each 
event 

BEST  ACT 

Expected 

Cost 

.12 

.08 

$12,000 

Grade  B 

0   960 

i20 

.15 

20,000 

Grade  B 

3,000 

.35 

.27 

35,ooo 

Grade  B 

9,^50 

M 

.35 

38,000 

Grade  A 

13,300 

.55 

.15 
1.00 

38,000 

Grade  A 

5,700 

^32,^10 

The  cost  of  uncertainty  is  337,2KLO-  |32,^10  =  ;5,000  and 
this  sets  the  Expected  Value  of  Perfect  Information  to 
the  manufacturer. 

If  the  cost  of  sampling  is  ;100  per  tube,  it  follows 
that  the  economic  lir.it  of  n,  size  of  the  sarrole ,  is 
#5,000/100  =  50.   T; ■]  .ng  a  sample  larger  than  50  would 
cost  more  than  taking  the  lots  as  they  came  and  using 
the  best  z    z,    assuming  that  he  r  :_:..ehow  were  clairvoyant 
enough  to  determine  the  characteristic  of  each  lot  before 
processing. 

The  next  thing  that  must  be  done  is  to  choose  the 
best  decision  rule,  i.e.,  the  best  n  (sample  size)  and 
the  best  c  (criterion  number). 
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Table  11-4 
Conditional  costs 

ACT 


Use  Use 

GRADE  A GRADE  B, 


.12  §38,000        $12,000* 

.20  33,000         20,000* 

.35  38,000       35,000* 

.45  38,000*       45,000 

.55  38,000*      55,000 

The  conditional  cost  of  each  act  is  set  forth  in  Table 

11-4.   The  opportunity  losses  for  these  acts  are  set  forth 

in  Table  11-5.   If  the  manufacturer  wants  to  shift  to  Grade 

A  cobalt  when  the  process  characteristic  is  .12,  it  will 

cost  him  g 26, 000  more.   If  he  keeps  using  Grade  3  when  the 

process  characteristic  is  *55,    it  will  co;«  him  $17,000 

more  than  had  he  shifted  to  Grade  A  cobalt,  etc. 

Table  11-5 

Conditional  Opportunity  Losses 


GRADE  A 

GRADE  R 

Column  6 

.12 

$26,000 

(    o 

X 

.08 

= 

$2,080.00 

.20 

18,000 

0 

X 

.15 

= 

2,700.00 

.35 

3,000 

0 

X 

.27 

= 

810.00 

.45 

0 

7,000 

X 

.35 

= 

2,450.00 

.55 

0 

17,000 

X 

.15 

= 

2,550.00 
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Table  11-5  sets  forth  the  opportunity  losses  if  the 
wrong  act  is  elected  for  fractions  above  and  below  the 
breakeven  value,  P^  =  .33. 

Figures  11-1  and  11-2  are  selected  error  characteristic 
curves  plotted  for  sample  sizes  5,  10,  12,  15,  17,  20,  50, 
and  100  with  various  rejection  numbers,  and  a  p  from  0  to 
.60.   These  curves  are  plotted  directly  from  Table  1  in 
Schlaifer.   It  is  possible  to  read  the  probability  of  wrong 
decision  directly  from  these  curves.   To  read  the  probabil- 
ities of  wrong  decision  if  the  true  p  is  less  than  the  break- 
even value,  read  the  probabilities  from  the  left-hand  margin 
of  the  figure;  if  the  true  p  is  greater  than  the  breakeven 
value  then  read  the  probability  of  wrong  decision:  from  the 
right-hand  margin.   For  example,  in  the  electron  tube  problem, 
for  sample  size  20,  the  probability  of  a  wrong  decision 
using  a  rejection  number  of  10  for  a  true  p  of  .35  is  P(wd)  =  .12. 
(It  is  read  from  the  left-hand  margin  because  .35  is  less  than 
pb  =  »38).   What  we  are  saying  is,  that  in  a  sample  size  of 
20  tubes,  if  10  or  more  are  defective,  the  manufacturer  in 
using  the  (20.10)  decision  rule,  has  a  probability  of  .12  of 
making  the  wrong  decision  when  p  =  .35,  i.e.,  based  on  the 
sample  he  assumes  that  p  of  the  batch  is  greater  than  .38 
when  it  actually  is  .35,  and  rejects  the  null  hypothesis  p  ^  .38, 
and  instead  of  using  Grade  E  cobalt  he  shifts  to  Grade  A  when 
he  should  have  continued  with  Grade  B;  he  would  have  made  a 
TYPE  I  error  (rejects  the  null  hypothesis  when  true).   To 
continue  the  illustration  of  decision  rule  (20,10),  if  the 
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true  p  Is  .45,  the  P(wd)  =  .60.   It  is  read  from  the  right- 
hand  margin  because  p  =  .45  is  greater  than  P}->  =  .38.   In 
this  instance,  we  are  saying  that  in  a  sample  of  20  tubes, 
if  less  than  10  are  defective,  the  manufacturer  in  using 
(20,10)  decision  rule,  has  a  probability  of  .60  of  making 
the  wrong  decision  when  p  =  .45,  i.e.,  based  on  the  sample 
he  accepts  the  null  hypothesis  that  p  of  the  batch  is  less 
than  .38  when  it  actually  is  .45  and  continues  using  grade 
B  cobalt  when  he  should  have  shifted  to  Grade  A;  he  would 
have  made  a  TYPE  II  error  (accepts  the  null  hypothesis  when 
false). 

11 . 3  Computation  cf  optimal  sarr.-ol-  ~- ze 
Decision  r^."  -:   .?C  7) . 

The  Tc":.l  Expected  Opportunity  Loss  will  be  the  sum  of 
the  Expected  Opportunity  Losses  for  each  fraction  defective. 
The  E0L  is  found  by  multiplying  the  Conditional  Opportunity 
Losses  by   the  joint  probability  of  P(p)  and  P(wd).   To  give 
us  more  of  a  clue  to  which  decision  rule  to  start  with,  we 
multiply  the  column  6  by  their  respective  P(p),  as  illustrat- 
ed in  Table  11-5,  column  6.   To  obtain  the  smallest  EOL ,  the 
P(wd)  must  be  as  small  as  possible  ">r  all  fractions,  but 
particularly  for  p  =  .12,  .20,  .45,  and  ,55*      Keeping  in  mind 
that  the  sample  size  must  be  less  than  55,   we  can  peruse 
Figures  11-1  and  11-2  for  a  reasonable  decision  rule  and  work 
from  there.   The  rule  (20,?)  has  been  selected  as  a  starting 
point  since  it  seems  to  best  fit  the  conditions  stated  above. 
The  calculations  are  illustrated  in  Table  11-6. 
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Table 

11-6 

Deci 

sion 

rul 

e  (20,7) 

(Slid 

e  rul 

e  calculations) 

p 

Column  6 

P  ( wd ) 

E0L 

.12 

$2,080 

X 

.0067 

= 

13. 

95 

.20 

2,700 

X 

.0867 

= 

234. 

00  . 

.35 

810 

X 

.5834 

= 

4?2, 

00 

M 

2,450 

X 

.1299 

= 

318, 

00 

.55 

2,550 

X 

.0214 

3 

5*. 

60 

•l 

,092, 

55 

C 

DSt  Of 

2 

,000, 

00 

S; 

arnpling 
UETL 

= 

;)3 

,092. 

55 

USTL 

=  Un< 
sion 

sond 

itional  Expected  Tota 

1  (opportunity)  Loss 

Deci 

Rul 

2  (19,7) 

r> 

Column  6 

P(-d) 

.12 

#2,080 

X 

•  Ou- 

= 

S? 

10, 

00 

.20 

2,700 

X 

.0676 

= 

182, 

30 

.35 

810 

X 

.5188 

= 

420. 

00 

M 

2,450 

X 

.1727 

-• 

42  3. 

00 

.55 

2,550 

X 

.0342 

= 

87. 

30 

$1 

,122, 

60 

Cost  of 

Sampling 
USTL 

= 

1 

,900. 

00 

13 

,022. 

,60 
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A  tabulation  of  decision  rules  are  given  in  Table  11-?. 
' Table  11-7 

20,?  ^3,092.55 

20,8  3,179.^2 

19.7  3,022.60 

19.8  3,233.68 

19.6  3,110.80 

18.7  3: 000. 58* 

18.6  353-<3  00 

17.7  3,04-0.78 


Based  on  the  calculations  which  derived  Table  11-7, 
it  is  concluded  that  the  best  decision  rule  is  (18,7), 
i.e.,  n  =  18  and  c  =  7.   The  opportunity  loss  theoretically 
will  be  reduced  by  $5,000  -  $3,000),  or  |2,000  per  run  of 
electron  tubes. 

Note  that  in  Table  11-5,  column  f-      the  sum  of  value 
of  column  6  for  p  =  ..12,  .20,  and  .35  is  the  unconditional 
expected  loss  for  the  Terminal  Act,  Use  frrade  A  cobalt,  and 
the  sum  of  the  values  of  column  6  for  p  =  A$   and  .55   is  the 
unconditional  excected  loss  for  the  Terminal  Act,  use  grade 
B  cobalt. 
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Problems 

2 
11.1    A,  manufacturer  is  faced  with  a  problem  of  deciding 

how  to  inspect  a  particular  part  as  it  comes  off  the 
assembly  line.   The  parts  can  either  be  good  or  defect- 
ive ,  and  the  manufacturer  can  choose  between  two  inspect; 
ion  systems.  Both  inspection  systems  will  give  perfect 
performance  on  good  parts;  that  is,  a  good  part  will 
never  be  classified  as  defective.   The  two  systems 
differ  on  classifying  defectives.   The  first  device, 
which  is  mechanical,  will  misclassify  at  a  rate  of  .01, 
.05,   10,  or  -.15,  depending  on  how  it  is  adjusted  at  the 
beginning  of  each  productic,  run.   Because  of  the  nature 
of  the  production  run,  once  the  ....chine  is  adjusted  it 
cannot  be  readjusted  until  the  next  run.   The  adjustment 
record  for  the  past  1,000  adjustments  is  as  follows: 
%   Defectives        Number  of 


•Cisclass 

ified 

Occurrences 

.01 

100 

.05 

300 

.10 

500 

.15 

100 

1,000  ■ 

The  alternative  inspection  system  is  to  place  a  workman 
on  the  assembly  line,  and  the  experience  in  the  past  hs 3 


ibid.  Problem  12-4,  p.  177.   This  problem  requires 
relatively  large  amount  of  time. 
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been  that  this  system  produces  only  .01  misclassif ications 

If  the  workman  is  used,  the  incremental  cost  over 
the  machine, will  be  $2$   per  production  run  (a  run  is  1,000 
pieces).   However,  when  a  defective  is  classified  as  good, 
there  is  a  cost  of  reworking  the  subassembly  when  the  part 
is  used.   This  cost  is  $.50  per  part  misclassif ied. 

a)  Under  what  conditions  would  the  inspection 
process  be  a  Bernoulli  process? 

b)  Assuming  the  conditions  in  (a)  are  fulfilled, 
calculate : 

1)  The  expected  monetary  value  of  the  :  .  st  act. 

2)  The  expected  cost  under  certainty. 

3)  The  expected  value  of  perfect  information. 

c)  Suppose  the  machine  could  be  adjusted,  a  sample 
of  up  to  50  parts  taken  to  check  the  adjustment, 
and  then  the  machine  could  continue  to  be  used 
or  a  workman  could  be  placed  en  the  assembly 
line  for  the  remainder  of  t:_   run. 

1)  Would  a  sample  of  10  sri--      be  worthwhile 
if  :  .  -ost  of  sampling  is  $.10  per  part? 
What  is  the  best  criterion  number  for  a 
sample  of  10? 

2)  Is  a  sample  of  20  better  than  a  sample  of  10? 

3)  What  is  the  maximum  sample  that  would  be 
taken,  based  on  expected  value  of  perfect 
information. 
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1.  What  are  the  probabilities  that  a  destroyer 
will  draw  exactly  1,  exactly  2  and  exactly 
3  reparables  under  the  condition  that  there 
are 

a)  10  non-reparables  in  the  lot? 

b)  35  non-reparables  in  the  lot? 

c)  68  non-reparables  in  the  lot? 

2.  What  are  the  probabilities  that  a  DLG  will 
draw  9  or  less,  6  or  more,  and  less  than  3 
reparables  under  conditions 

1.  (a),  (b),  and  (c)  above? 

3.  What  are  the  probabilities  that  a  CG  will 
draw  exactly  15  good,  8  or  more,  and  less 
than  three  reparables  under  conditions 

1.   (a),  (b),  and  (c)  above? 
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CHAPTER  12 
Revision  of  Probabilities  In  the  Light  of  New  Information 
12.1  The  basic  text  gives  an  easy  to  follow  explanation  of 
Baye's  theorem  with  figures  and  tables.   Another  approach 
Is  to  view  the  problem  In  the  same  manner  as  that  employed 
In  Chapter  8  on  marginal  probability.   Using  the  same 
Illustration  of  section  12.1  In  which  we  have  an  urn  con- 
taining 10  balls  In  the  following  mix: 

1.  i  are  striped;  of  these  .2  are  red  and  .8  are 
green 

2.  ^  are  dotted;  of  these  .6  are  red  and  .k   are 
green,  we  know  the  following: 

a)  10  balls,  5  are  striped  and  5  dotted 

b)  of  the  striped  ones,  1>(.2  x  5  ■  1)  Is  red 
and  ^,(.8  x  5  -  *0  are  green 

c)  of  the  dotted  ones  3, (.6  x  5  -  3)  are  red 
*       and  2, (.4  x  5  =  2)  are  green 

This  can  be  set  In  a  table  of  absolute  values: 

Table  12.1 

Red Green Total 

Striped . ,. 

Dotted 

? I 

Total 


l\lO 


and  easily  converted  to  a  table  of  marginal  probability: 
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Table 

12 

.2 

Red 

Green 

Total 

Striped 

.1 

.4 

.5 

Dotted 

.3 

.2 

.5 

Total 

.4 

.6 

\1.0 

1.0X 

The  four  values  in  the  center  of  the  table  are  the 
various  Joint  probabilities-red  striped,  red  dotted,  green 
striped  and  green  dotted-while  those  on  the  periphery  of 
the  table  are  the  unconditional  probabilities-red,  green, 
striped,  and  dotted. 

The  probability  of  drawing  any  one  ball  without  prior 
information  is  one  of  the  Joint  probabilities.   But  If  a 
ball  is  drawn  and  it  is  known  to  be  red,  then  the  probability 
that  it  is  striped  is  a  conditional  probability  and  is 
.1/.^  =  .25.   Given  that  a  ball  is  dotted,  the  probability 
that  it  is  green  would  be  .2/. 5  =  .**.  Etc. 

The  Joint  probabilities  of  the  above  table  can  be 
equated  to  the  original  probabilities  of  Figures  12.1  and 
12.2  of  the  basic  text.   The  revised  probabilities  of 
Figure  12.3  and  Table  12.2  are  obtained  in  a  manner  similar 
to  that  in  the  above  paragraph. 

The  above  illustration  has  practical  applications  in 
marketing  and  quality  control  because  it  reduces  the  cost 
of  uncertainty  which  depends  on  the  original  prior  prob- 
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abilities. 

12.2  Assume  that  a  manufacturer  is  turning  out  a  machined 

product  from  raw  material  and  that,  dependent  upon  the 

quality  of  raw  material,  he  has  historically  gotten  a 

1 
fraction  defective  as  follows: 

Table  12.3 

■ 
Fraction  defective,  p   Frequency  of  Occurrence  P(p) 

.01  .25 

.05  .25 

•1°  .50 

1.00 

It  has  previously  been  determined  by  the  method  shown 
in  Chapter  11  that  the  best  sample  plan  is  (16,1)  from  a 
production  run  of  500  pieces.  A  sample  is  taken  and  two 
defectives  found.   How  should  the  original  prior  probability 
distribution  be  revised  and  what  action  should  the  manufacturer 
take? 

Two  defectives  in  a  sample  of  16  is  a  conditional 
probability  in  that  it  depends  on  the  fraction  defective, 
p.   Using  the  binomial  table  to  compute  the  donditional 
probability  of  getting  two  defectives  in  a  sample  of  16  we 
get  the  following  results: 


i 
This  example  is  derived  largely  from  Bierman,  Harold,  Jr., 

Fouraker,  Lawrence  E.,  and  Jaedicke,  Robert  K.  Quantitative 

Analysis  .for  Business  Decisions.  Homewood,  Illinois: 

Richard  D.  Irwin,  Inc.  1961.  Pp.  161-175 
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Table  12.4 


Fraction      Prior  marginal  prob-      Conditional 
defective,  p  abilities  known  before    Probabilities 
sample  was  taken,  P(p)     from  Sample 


irom  sample 
P(r  =  2  [  16.  p) 


.01  .25  .0104 

.05  .25  .1463 

.10  .50  .2745 

1.00 

We  can  now  compute  the  .joint  probability  for  each 
fraction  defective,  1.  e.,  the  probability  of  a  .01  de- 
fective batch  of  raw  material  and  finding  two  defectives 

in  a  sample  of  16,  etc. 

Table  12.5 

Fraction  Defective,  p  Prior  Prob.  Cond.  Prob.     Joint  Prob. 
li£) P(r  =  2  |  16,  p)  P(r  =  ,V2.p) 

.01  .25  .0104  .0026 

.05  .25  .1463  .0366 

.10  .50         .2745  .1373 

1.00  .1765 

The  sum  .1765  is  the  marginal  probability  and  is  the 
probability  of  getting  two  defectives  in  a  sample  of  16 
from  this  raw  material. 

As  a  result  of  the  sample  information  we  would  want  to 
revise  the  prior  probabilities.  V/hat  we  are  looking  for  is 
the  probability  of  the  various  fraction  defectives  given  a 
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sample  of  16  in  which  we  found  two  defectives.   This  is  a 
conditional  probability  and  can  be  found  in  the  same  manner 
as  above  when  we  wanted  the  probability  that  a  ball  was 
striped  when  we  had  prior  information  that  it  was  red;  in 
that  case  we  divided  the  joint  probability  .1  by  the 
marginal  probability  .4  to  get  the  conditional  probability 
.25.   Using  the  same  method  we  would  get  the  following 
revised  (conditional)  probabilities  in  the  quality  control 
problem: 

Table  12.6 


Fraction 
defective, 


Cond. 

Prior    Prob.  Joint 

Prob.    P(r=2|  Prob. 

Uj>l 16. p)  P(r=2.p) 


Rev.  Prob. 
P(p|r=2) 


.01 

.05 

.10 


.25 
.25 

.50 


.0104 
.1463 


.0026 
.0366 


♦0026  =  m0lk? 

•  J.  (  O.P 


1.00 


.2745     .1373 


^1765 


4fr  =  '7?80 
,JVO:?      1.0001 


Prior  probabilities  refer  to  the  probabilities  assigned 
to  a  basic  random  variable  before  a  sample  is  taken  and 
posterior  probabilities  refer  to  the  revised  probabilities 
as  a  result  of  sample  information. 
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Problems 

1.   A  manufacturer  assigns  the  following  prior  probabilities 
to  the  percent  defective  in  a  production  process. 

Event  Prob.  of  Event 

.01  defective  .20 

.05   "  .30 

.10  "  .50 

The  rework  cost  of  a  defective  is  5°  cents  and  the 
length  of  the  run  is  1,000  pieces. 

a)  Compute  the  expected  cost  of  using  the  process 
for  a  run  of  1,000  pieces. 

Solution: 

Fraction  . Cost 

Defective, 

p Prob.  P(p)  Conditional Expected 

.01         .20  $.50  x  .01  x  1,000  ■  $5.00  $5.00  x  .2  =  $1.00 

.05  .30      .50  x  .05  x  1,000  =  25.00  25.00  x  .3  =    7.50 

•10         .50   .50  x  .10  x  1,000  -  50.00  50.00  x  .5  =25.00 
1.00  $33.50 

b)  A  sample  of  10  pieces  is  run  off  and  two  de- 
fectives are  found.   Revise  the  probabilities 
to  take  account  of  the  new  information. 
Compute  the  expected  cost  of  using  the  process. 

^■Problems  1  and  2  are  derived  from  Blerman,  Harold,  Jr., 
Fouraker,  Lawrence  E.,  and  Jaedicke,  Robert  K.  Quantitative 
Analysis  for  Business  Decisions.  Homewood,  Illinois:  Richard 
D.  Irwin,  Inc. ,  196I  Pp.  178-179;  problem  3  from  Mos teller, 
Frederich,  Rourke,  Robert  E .  K. ,  and  Thomas,  George  B.,  Jr. 
Probability  with  Statistical  Applications.  Reading,  Mass: 
Add is on -Wesley  Publishing  Co. ,  Inc. ,  1961  P.  1^8  and  problem 
^  from  Wadsworth,  George  P.  and  Bryan,  Joseph  G.  Introduction 
££  Probability  and  Random  Variably.  N«w  ¥orki  MoOraw-Ktll, 
I960.  P.  32 

12-6 


vo 

o 

vn 

H 

c> 

■3- 

vo 

^J- 

o 

«H 

OVT\ 

• 

• 

• 

• 

^" 

-3" 

■3- 

*& 

<©= 

-p 

W 

II 

II 

II 

o 

o 

o 

o 

O 

o 

o 

o 

tJ 

• 

• 

• 

0) 

v> 

VO 

o 

•p 

«* 

<V1 

VT\ 

o 

<u 

M 

X 

K 

a 

H 

CM 

VO 

(^ 

w 

C>- 

v>v 

O- 

o 

VO 

CM 

o 

H 

CD 

■p 

c 
o 
o 


o 

■P 

H 
O 

w 


6 
H 
O 


^-v 

•o 

CM 

a> 

II 

CO 

i^ 

•H 

,n 

> 

o 

P 

0) 

u 

^_* 

rd  (x,  pl, 

-p 
o 

•-3 


^-* 

p 

m 

Prob 
P(r=2 

• 

o 


•      cvj 

■d^  il 

C  O^f-i 

o  ^M 

O  PL,  PL, 


op- 

■H    O    P 
PL,  PM  PM 


£2  > 
O  «H 
•H  +> 

■P   O 

o  a)  p 

£2 


CM 

o 
o 


CO 

o 
o 
o 


vo 
vrv. 
vO 
H 


CD 
C*-v 
On 


CM       VO 

O       vo 
O        O 


CM 


CM 
CD 


OH 
O-O 


o 
o 


w^c- 


CD 
VO 


O 
O- 


O 


CN 

en 

Ov 


O      vr>o 


w,      O 
O        H 


I 
CM 


2.  A  manufacturer  produces  a  special  type  of  gear  casting. 
His  regular  machine  Is  In  need  of  repair,  but  he  has  a 
standby  machine  which  In  the  past  has  had  the  following 
defeot  record: 

%   defective  Historical  frequencies 
.01  100 

.05  100 

.10  800 

For  the  current  run  of  500  parts  the  manufacturer  can 
either  subcontract  the  work  at  a  cost  of  $10.09  per 
part  or  use  the  standby  machine  at  an  Incremental 
cost  of  production  of  $10.00.   The  cost  to  rework  a 
defective  piece  Is  $5.00. 

a)  What  Is  the  best  act  without  sampling  If  the 
historical  frequencies  are  considered  prior 
probabilities? 

What  Is  the  opportunity  loss  of  each  act? 
Solution:  Cost  due  defectives 

Fraction  Defective,  p  Prob  P(p)  Conditional    Expected 
.01  .1        $5.00       $   .50 

.05  .1       25.00        2.50 

.10  .8        50.00        40.00 

1.0  $43.00 
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Expected  cost  of  manufacture! 

500  x  $10.00  +  $43.00  =  $5,043.00 
Cost  of  subcontract! 

500  x  10.09  =  $5f045.00 

Therefore  best  act  is  to  manufacture. 


Conditional  Costs    Opportunity  Losses 


Sub- 

Sub- 

Event 

Manufacture 

contract 

Manufacture 

contract 

.01  defective 

$5,005 

$5,045 

0 

$40 

.05  defective 

5,025 

5,045 

0 

20 

.10  defective 

5,050 

5,045 

$5 

0 

t>)   What  is  the  best  rejection  number  for  a  sample 
size  20? 
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3.  In  a  factory  machine  A  produces  30%   of  the  output, 
machine  B  produces  2$% ,  and  machine  C  produces  the 
rest.  One  percent  of  the  output  of  machine  A  is 
defective,  1.2$  of  B»s  and  '2%   of  C's.   In  a  day's 
run  the  three  machines  produce  10,000  items.  What 
is  the  probability  that  a  defective  item  drawn  at 
random  from  a  day's  output  was  produced  by  each 
machine? 

Answer:  A-  .2,  B-  .2,  C-  .6 

4.  An  insurance  company  found  that  depositors  who  have 
sufficient  funds  in  their  checking  accounts  postdate 
a  check  by  mistake  once  in  a  thousand  times,  while 
depositors  who  have  insufficient  funds  invariably 
postdate  their  checks.  When  a  postdated  check  is  re- 
ceived at  the  bank,  what  is  the  probability  that  it 
was  written  by  a  depositor  with  insufficient  funds 
if  these  depositors  are  one  percent  of  the  total? 
Answer:   .91 
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CHAPTER  13 

Suspension  of  Judgment  and  Summarization  of  Information 

i 
13.1  Analysis  by  Test  of  Significance 

Referring  to  the  electron  tube  problem  of  Chapter  11, 
suppose  a  sample  of  18  tubes  has  been  taken  and  r0  =  8. 
Is  the  evidence  derived  from  this  sample  sufficient  to 
itfarrant  a  definite  conclusion  on  the  part  of  the  manufactur- 
er or  should  he  take  another  sample  before  deciding  that 
p  A  38  and  use  grade  B  cobalt  or  that  p  ^.38  and  use  grade 
A  cobalt.   By  taking  the  breakeven  value  and  multiplying  by 
the  sample  size,  n  =  18,  p^n  =  .38  x  18 

=  5.3--,  but  r  must  be  discrete 
so  we  use  r  =  7.   Substituting  in  ?(r  =  >cn  -  d0|  n,^) 
+  P(£  ^  pbn  +  d0|n,  Pb)  it  follows  that 

P(r  ^  6|n  =  18,  p  =  .33)  =  A&2b 
P(r  ^  8|n  =  18,  p  =  .38)  =  .3681 


.8105 
Similar  to  the  example  in  13.2.1  in  Schlaifer,  o(    would 
have  to  have  been  set  at  .8105,  so  the  decision  should  be 
to  take  another  sample.   The  probability  of  getting  6  or 
less  or  8  or  more  defectives  in  a  sample  size  of  18  at  the 
breakeven  value  of  p  is  .8105.   This  is  also  the  probability 
of  not  getting  7  defectives. 


1 

Robert  Schlaifer,  Statistics  for  Business  Decisions 

McGraw-Hill  Book  Co.,  Inc.  19&1.  p.  199. 
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13.2  Revision  of  prior  probabilities  .given (rp  =  8,  n  =  13,  pj 

Having  obtained  8  defects  in  our  first  sample,  we  can 
revise  out*  prior  probabilities  with  the  information  obtained 
from  this  sample  and  go  through  another  analysis  similar  to 
that  described  in  Chapter  11,  to  see  if  it  is  more  economical 
to  shift  to  grade  A  cobalt  now  or  take  another  sample  based 
on  the  revised  probabilities.   Table  13-1  shows  the  compu- 
tation of  the  revised  probabilities.   Table  13-2  shows  the 
computation  for  decision  rule  20,7.   Table  13-3  shows  the 
computation  for  decision  rule  19,7  and  20,8.   Comparing  the 
UETL  from  these  three  decision  rules,  it  is  readily  seen 
that  decision  rule  (20,7.)  is  tl*.  -  best  rule  of  the  three, 
however  we  stilj  don't  know  if  it  is  the  best  rule  for 
another  sample  since  we  have  not  tested  dacision  rule  (21,7). 
Here  is  one  of  the  limitations  to  an  incomplete  set  of 
binomial  tables  such  as  Schlaifer's  Table  I. 

Table  13-1 

Computation  of  Revised  Probabilities 

based  on  (n  =  18,  p  =  p,  rQ  =  8) 

p    P(to)  'x  .  P(r  =  8ll8,p)      -  (r  =  8,p)  P(pjr  =  8) 

.00004  .0003 

.0018  .0147 

.0358  .2940 

.0653  .5370 

.0187       .15^0 
.12164     1.0000 
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.12 

.08 

X 

.0005 

.20 

.15 

X 

.0120 

.35 

.27 

X 

.1326 

.45 

.35 

X 

.1863 

.55 

.15 

X 

.1247 

Table    13-2 
Decision  rule    (20,7) 


p 

P(P|r  = 

8) 

Cond. 
Loss 

Column 

4        X 

P(wd) 

= 

EOL 

.12 

.0003 

X 

|26,000  = 

I    7.80 

.0067 

= 

$ 

.05 

.20 

.0147 

X 

18,000  = 

264.60 

.0867 

= 

22.95 

.35 

.2940 

X 

3,000 

882.00 

.583^ 

= 

514.00 

M 

.5370 

X 

7,000 

3,759.00 

.1299 

= 

488.00 

.55 

.1540 

X 

17,000 

2,618.00 

.0214 

— 

56.00 

Si 

,081.00 

Cog 

t  3f  Sampling 
USTL 

2 

,000.00 

$3 

,081.00 
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Table  13-3 
Decision  rule  19,7 


Column  4        P(wd) 


.12             $    7.80   x    .0048    =  §     .04 

.20                264.60   x    .0676    =  17.90 

.35                882.00   x    .5188    =  457.00 

.45              3,759.00   x   .1727    =  649.00 

.55                                   2,618.00   x    .0342    =  89.50 

§1,213.44 

Cost  of  sampling  1,900.00 

UETL  $3,113.^ 


Decision  rule  20,8 


Column  4        ?(v;d) 


.12  $    7.80  x    .0004  =  §     .03 

.20  264.60  x     '21  =  8.50 

.35  882.00  x    .3990  =  352.80 

.45  3,759.00  .2520  =  947.00 

.55  2,618.00  .0580  =  152.00 


$1,460.33 

Cost  of  sampling     2,000.00 

UETL        $3,^60.33 
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In  order  to  test  decision  rule  (21,7)  a  more  complete  set 

of  binomia,!  tables  must  be  used.   If  they  are  not  immediately 

available ,  they  can  be  calculated  by  converting  values  de- 

rived  from  Table  I,  Schlaifer. 

13-3  Calculation  of  a  Probability  when  the  sample  size  is 

increased  by  1. 

It  is  possible  to  use  the  Table  I  in  Schlaifer  to 
compute  the  P(r  =  rjn  +  ,1,p).   For  example  from  Table  I, 
P(r  =  7 | 19 ,  .38)  is  .i860.   The  cuestion  is  how  can  we 
calculate  P(r  =  7 | 20,. 33).  Proceed  as  follows: 

P(r  =  7  1 19,. 38)  =  C^9  p7  c12  (1) 

.1860  =  12! x  (.3^:  ;.62)12  (2) 

7!  12! 

P(r  =  7|20,.38)  =  — x  (.3S)7(.62)13  (3) 

75  13? 

=    20xl9!   (.33)?(.62)12(.62)      W 

7'.    13  x  12! 


20  x  .62       19 


13        72  12! 


x  (.38)?(.62)12  (5) 


P(S«  =  7|20,.38)  =  20  x  *62  (.1860)  (6) 

13 

=  .177^  (7) 

From  Table  I  ue  read  the  same  .1774.   Therefore,  by 

choosing  the  nearest  va.lue  in  the  table,  other  values  not 

given  in  a  set  of  binomial  tables  may  be  computed  in  this 

same  manner.   The  farther  away  from  the  table  value,  the 

more  complex  is  the  calculation  however.   To  get  the 
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cumulative  probability,  for  example,  P(r  =  7J21,  .12), 

one  would  have  to  calculate  each  P(£  =  p,  1,  2,  3,  kt    5» 

6  1 20,. 38)  then  sum  these  values  and  subtract  from  1.0000. 

This  process  can  become  very  laborious.  A  complete  set  of 

binomial  tables  is  far  more  convenient.   Glable  13-^  is  an 

example  of  such  a  calculation  for  P(r  ^  7 1 21,. 38). 

Table  13-4 

Calculation  of  P(r  =  7  I  21,. 12)  from 

P(r  =  0,  1,  2,  3,  lK    5,  6[20,.38) 

Conversion 
r P(£  =  r|20,.12) Factor       P(ft  =  rJ21,.12) 

0  .0776     x     v88  *  21        .0683 

1  .18^     x     ^8x21        -1702 

20 

2  .26*1-0     x  *88  ?q21         .2270 

3  .22^2    x    *88  x  21       .2310 

lo 
k  .1300     x     jl38x21        mli>13 

5  •     .0567     x    '88  x621       .0656 

6  .01;;,-      x    *88  *  21       .0165 


P(£  ^  7I21,  .12)  =  .9^99 
then  P(r  ^  7I21,  .12)   =  .0501 


Table  13-5  lists  the  values  of  P(wd)  for  decision  rule 
(21,7)  calculated  by  the  method  just  illustrated. 
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Table  13-5 
Probability  (wrong  decision  21,7) 


P(wd) 


.12  .0501 

.20  .2017 

.35  .641? 

.^5  .0964 

.55  .0130 


Table  13-6  is  the  summary  of  UETL's  calculated  for 
four  decision  rules.   The  best  decisis.:  -  ;,le  based  on  a 
revision  of  prior  probabilities  is  (20,7).   To  determine 
whether  or  not  the  manufacturer  should  take  this  second 
sample,  we  must  compare  the  Unconditional  Expected  Terminal 
Loss  from  decision  rule  (20,7)  with  the  Expected  Value  of 
Perfect  Information  based  on  the  rosterior  probabilities 
computed  in  Table  13-1.   Table  13-,  portrays  the  "£::pected 
Cost  under  Certainty  for  the  posterior  probabilities. 
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Table  13-6 
Summary  of  UETI's 


Decision  Rule 


UETL 


19,7 
20,7* 

20,8 

21,7 


3,081.00* 

3,460.33 

3,114.79 


*Best  rule 


Table  13-7 
Expected  Cost  under  Certainty 


Best  Cond. 

Cost  Given 

each 

Expected 

p 

p(p) 

event 

BEST  ACT 

Cost 

.12  .0003  §12,000 

.20  .01*1-7  2.0,000 

.35  .2940  35,ooo 

.45  .5370  38,000 

*55  .1540  38,000 


Grade 

B 

$           3.60 

Grade 

3 

294.00 

Grade 

3 

10,290.00 

Grade 

A 
ft. 

20,406.00 

Grade 

._ 

5,852.00 

.^36,845.60 


The  cost  of  -j  certainty  is 

,,33,000  -  $36,846  =  $1,154 
and  this  sets  the  Expected  Value  of  Perfect  Information 
to  the  manufacturer.   By  comparing  the  EVPI  with  the  UETL 
of  decision  rule  (20,7),  i.e., 
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EVPI  =  81,15^ 

UETL  =   3,031, 
it  is  apparent  that  the  manufacturer  should  not  take  the 
second  sample,  but  switch  to  grade  A  cobalt  and  complete 
the  run  since  the  USTL  exceeds  the  EVPI  by  $1,927, 
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1 

Problem 

— — — — — 

13-1  A  manufacturer  assigns  the  following  prior  probabilities 
to  the  per  cent  defective  in  a  production  process  considered 
Bernoulli: 


Event 

Probj 
of 

ability 
Event 

.01  defective 

.20 

.05  defective 

.30 

.10  defective 

....50 

1.00 

The  rework  cost  of  a  defective  is  50  cents;  the  length 
of  run  is  1,000  pieces. 

a)  Compute  the  expected  ccl.^   g.  ...  ing  the  process 
for  a  run  of  1,000  pieces. 

b)  A  sample  of  ten  pieces  is  run  off  and  two  defectives 
are  found.   Revise  the  probabilities  to  take  account 
of  the  new  information.   Compute  the  expected  cost 
of  using  the  process. 


i 
Harold  bierman,  and  Others,  Quantitative  Analysis  for 

Business  Decisions,  p.  179,  Problem  12-8. 
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CHAPTER  Ik 
Measures  of  Variability; 
Variance  and  Standard  Deviation 

14.1  General 

There  Is  a  common  propensity  for  like  items  to  locate 
themselves  about  a  central  point.   In  Chapter  Six,  we  dis- 
cussed briefly  some  measures  of  central  tendency;  namely, 
the  mean,  median  and  mode.   But  measures  of  inequality 
such  as  distribution  of  income,  IQ  of  students  at  the  U.  S. 
Naval  Postgraduate  School,  height  by  age  groups  of  Japanese 
people,  educational  level  of  the  Russian  people,  etc.,  are 
problems  requiring  the  measurement  of  variability. 

While  the  mean,  median  and  mode  describe  respective 
central  locations  in  a  distribution,  they  tell  us  nothing 
about  the  variability  of  a  distribution.   For  example,  if 
we  knew  that  the  mean  depth  of  a  river  was  one  foot,  we  had 
better  not  try  wading  across  until  we  find  out  what  the 
extreme  depths  are,  for  the  mean  does  not  really  tell  us 
much  about  the  river  depth.  What  we  want  to  know  is  how 
representative  is  the  mean;  how  is  the  distribution  spread 
around  the  mean?  What  is  the  "spread",  "scatter",  "dispersion1 
or  variability?  The  answer  to  this  question  gives  rise  to 
two  new  terms:  the  variance  and  its  square  root,  the  standard 
deviation. 

14.2  The  Variance 
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The  difference  between  the  mean  and  an  individual 
value  in  a  distribution  is  the  deviation  of  that  value. 

The  variance  is  found  simply  by  (1)  averaging  the 
squares  of  all  deviations  in  the  distribution  or  (2)  by 
summing  the  weighted  (i.e.,  relative  frequency)  deviations 
squared . 

Using  the  data  of  Table  6.1  Hypothetical  Distribution 
of  Lieutenants  in  the  Navy,  we  know  that  the  mean  of  the 
random  variable,  Z  is  27;  we  can  compute  the  variance, 
0"  (Z),  in  either  of  two  ways  as  follows: 

Table  14.1 


of 

ndcmj 
le  Z 

De- 
viation 

De- 
viation 
Squared 

Ave.  of 
deviations  (1) 

Wt*d.  ave. 
method  (2) 

Value 
the  ra 
var  iab 

No.  of 
occur- 
rences 

Pro- 
duct 

Rel. 
Freq 

Product 

23 

-4 

16 

750 

12,000 

.05 

.80 

24 

-3 

9 

750 

6,750 

.05 

M 

25 

-2 

4 

1,500 

6,000 

.10 

.40 

26 

-1 

1 

2,250 

2,250 

.15 

.15 

27 

0 

0 

3,750 

0 

.25 

0 

28 

+1 

1 

3,000 

3,000 

.20 

.20 

29 

+2 

4 

1,500 

6,000 

.10 

.40 

30 

+3 

9 

750 

6,750 

.05 

.45 

31 

+4 

16 

7$o 
15,000 

12,000 
54,750 

.05 
1.00 

.80 

3.65 

Var  is 

mce  =  <F 

(Z)  = 

54,750  ,  3 

,65 

15,000 
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1^.3  The  Standard  Deviation 

The  Standard  Deviation  is  simply  the  square  root  of 
the  variance. 

In  the  above  example,  Standard  Deviation 
(T(Z)  =  3.65  =  1.91  years 

The  variance  and  standard  deviation  are  measures  of 
the  variability  of  a  distribution  which  have  not  previously 
been  used  in  the  basic  text.   Although  they  are  not  the  only 
measures  of  variability,  they  are  the  only  ones  suitable  for 
the  sampling  type  problems  found  in  the  subsequent  chapters 
of  the  text.   Furthermore,  we  have  to  this  point  been  con- 
cerned only  with  the  binomial  distribution  which  has  a 
serious  limitation  in  that  it  requires  an  unweildy  number  of 
tables  in  order  to  be  used  In  all  problems.   This  limitation 
can  be  met  by  use  of  the  Normal  approximation  to  binomial 
probabilities;  complete  Normal  probability  tables  occupy 
only  one  page.   The  term  "Normal"  also  extends  to  the  dis- 
tribution itself  which  has  certain  characteristics  that  are 
described  in  the  next  chapter.   It  also  has  some  limitations; 
in  general,  in  order  to  use  the  Normal  distribution  as  an 
approximation  to  the  binomial, 

p  x  n  ^  5 
q  x  n  ^  5 
That  is,  the  mean  long  run  expected  value  of  successes  and 
failures  in  the  Bernoulli  process  should  be  equal  to  or 
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greater  than  five. 
Problems : 

1.  Find  the  approximate  mean,  variance  and  standard 

deviation  by  two  methods  of  wages  in  the  following 

group  of  500  workers. 


WAGES 

NO.  OP  WORKERS 

40  to 

$  60 

60 

60  " 

80 

70 

80  " 

100 

85 

100  " 

120 

110 

120  " 

140 

95 

140  " 

160 

80 

500 

Answer:  Mean  $104,  Variance  $1,020,  Standard 
Deviation  $31.90. 
2.   One  automatic  screw  machine  is  used  to  produce 
shafts  and  another  to  produce  bushings  one  of 
which  is  assembled  over  each  shaft.  The  mean 
diameter  of  the  shafts  is  1.000  inch;  the  mean 
inside  diameter  of  the  bushinp-s  is  1.002  inches; 
and  the  Standard  deviation  of  either  set  of 
diameters  is  .001  inch.   Compute  the  mean  clearance 
(bushing  diameter  minus  shaft  diameter)  and  the 
standard  deviation  of  the  distribution  of  clearance 
on  the  assumption  that  assembly  is  random— i.e., 
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bushings  are  not  selected  to  fit  the  shafts 

on  which  they  are  placed. 

Answer i 

Mean  clearance,  .002;  stand  deviation, 

.OOHfHf1 


1Thls  problem  is  from  Sohlaifer,  Robert.  Probability  and 
Statistics  for  Business  Decisions.  New  Yorki  McGraw  Hill 
Book  Company  Inc.,  1959.  P.  272 
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CHAPTER  15 

The  Normal  Distribution  and  the  Normal  Approximation  of 
1 ^ 

the  Binomial' Distribution 

15-1  Approximating  the  Binomial  Distribution 
P(r  =  r  n  -  50,  P  =  .5).  §J 

The  normal  distribution  is  an  excellent  model  and 
useful  addition  to  the  statistician's  tool  kit.   It  can 
be  used  to  approximate  the  binomial  distribution.   It 
should  be  remembered  that  the  binomial  distribution  is  an 
exact  discrete  distribution.   The  Normal  distribution  is 
a  continuous  distribution,  so  in  order  to  approximate  the 
binomial  distribution,  it  must  be  made  discrete  by  approximat- 
ing a  binomial  histogram.   For  example,  take  the  binomial 
distribution  P(£  —  r|  n  =  50,  P  -  .5) $   how  can  we  approximate 
this  distribution  with  the  Normal?  Proceed  as  follows: 

n  =  50         CT*~   =  npc 


P  =  .5         (p     -   ,  ...  _ 
q  =  .5  T    m   i/50  x  .5  x  .5 

E(r)  =  vn  CT   =  3.53 


_._-,  =  .5  x  50 
2(r)   =  25 
For  the  Normal  distribution, 


u 


-  r 


^M   . 


3.53 

By  calculating  the  values  of  u  and  plugging  Table  III 
in  Schlaifer,  the  data  in  column  3,  Table  15-1  was  derived. 
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Column  2  is  comparative  data  taken  from  Table  I  in 
Schlaifer.   Notice  how  very  close  the  Normal  distribution 
anoroximates  the  binomial  under  the  particular  parameters 

of  (n  =  50,  P  =  -5). 

Two  examples  of  the  calculations  are  given  below 

with  Fig.  15-1  illustrating  the  logic. 

(1)  For  P(r  *  18|50,.5) 

3.53 

u  =  -  2.12 
From  Table  III  in  Schlaifer,  read  .01?00.  Then  1.0000- 

.01700  =  .9830. 

Fig.  xj  1 


-r 

■' 

^s 

. 

f_. 

p\ 

■  '' 

^Ss^ 

1-  .  Ilfe^ 

17.  S    /S.S' 

25 

"                                       1 

zs 
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?rom  Fig.  15-1 ,  it  can  be  seen  that  the  shaded  area  is  what 

is  desired.   The  unshaded  area  is  read  directly  from  Table 

III.   Since 'the  total  area  under  the  curve  is  1.0000,  to 

obtain  the2  shaded  area,  all  that  is  necessary  is  to  subtract 

the  unshaded  area  from  1.0000.   As  Schlaifer  has  pointed  out, 

the  Normal  curve  is  symmetrical  with  respect  to  the  mean  so 

that  it  is  only  necessary  to  provide  tables  for  half  the 

Normal  distribution  and  the  other  half  can  readily  be 

derived. 

(2)  For  P(r  ^  25| 50,  .5) 
24.5  -  25 


u  = 


3.53 


u  =-.1416 

Enter  Table  III  in  Schlaifer  read  and  interpolate  .4439, 
then  P(r  ^  25 150, .5)  =  (.5000)  -  .4439  +  .5000 
P(r  ^  25|50,.5)  =  .5561 
although  Table  15-1  portrays  ?(r  -  r)  for  r  from  12  to  39, 
one  should  rea?  z-.    that  there  are  binomial  probabilities 
for  r  =  0,    1,  2,  3,  ...  11,  but  they  are  so  small  as  to  be 
almost  trivial.   For  example 

P(r  =  0|5O,.5)  =  ^oT*    >°(-5)50 

P(r  =  0|5O  ,.5) 
=  .000  000  000  000  001  7?6  356  839  400  250  464  677  810  668  945  312  5 
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Table  15-1 


Binomial 

Normal 

r 

fix  *   r|50,.5) 

P(?  ss  r|50,.5) 

m 

.fference 

12 

1.0000  -  e  * 

.9999' 

.0001 

13 

.9998 

.9998 

.0000 

14 

.9995 

.9994 

- 

.0001 

15 

.9987 

.9985 

- 

.0002 

16 

.9967 

.9964 

- 

.0003 

17  . 

.9923 

.9919 

- 

.0004 

18 

.9836 

.9830 

- 

.0006 

19 

.9675 

.9671 

- 

.0004 

20 

.9405 

.9406 

+ 

.0001 

21 

.8987 

.8988 

+ 

.0001 

22 

.8389 

.8389 

.0000 

23 

.7601 

.7601 

+ 

.0001 

24 

.  6641 

.  6642 

_ 

.0001 

25 

.5561 

.5561 

.0000 

26 

.4439 

.'439 

.0000 

27 

.3359 

.3359 

.0001 

28 

.2399 

.0000 

29 

•  loll 

.._  '11 

.0000 

30 

.1013 

-I  --to 
.  iv. 

- 

.0001 

31 

.0595 

.0596 

+ 

.0001 

32 

.0325 

.0329 

.0004 

33 

.0164 

.0170 

+ 

.0006 

34 

.0077 

.0080 

+ 

.0003 

35 

r>  a  **>  0 
•  ^'033 

.0036 

+ 

.0003 

36 

*  *J 

.0015 

+ 

.0002 

37 

ft   - 

+ 

.0001 

38 

. .  -  . 

.0001 

39 

.0000 

* 

+ 

.0001 

*  a  very  small  number. 
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when  p  =  .5  the  Binomial  distribution  is  symmetrical  with 
respect  to  E(r),  and  the  Normal  distribution  approximates 
it  even  when  n  is  fairly  small  for  example  n  =  10.  Table 
15-2  illustrates  this  point. 

Table  15-2 


Binomical 
P(r  *  r|10,.5) 


Normal 
P(f  a  r|10,.5)  Difference 


0 
1 
2 

3 
4 

5 
6 

7 

8 

9 
10 


1.0000 
.9990 
.9893 


.1719 
.  054? 

.010? 
.0010 


.9997 

- 

.0003 

.9991 

+ 

.0001 

.9864 

- 

.0029 

>-29 

- 

.0024 

-  r- 

+ 

.0008 

+ 

.0010 

.3760 

- 

.0010 

.1711 

- 

.0008 

+ 

.0024 

.0136 

+ 

.0029 

.0009 

- 

.0001 

When  p  or  q  are  small  the  bino..  -     .stribution  skews 
right  or  skews  left  and  the  N.ormal  distribution  approxims  .  .  :n 
is  not  as  good  as  the  Poisson  distribution  which  also  can 
approximate  the  binomial  as  pointed  out  in  Schlaifer.   The 
graphs  in  figures  15.6,  15.7,  and  15. 8  in  Schlaifer  bear 
this  out. 
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Problems : 

15-1  Verify  the  data  in  Table  15-2. 

15-2  A  fair  coin  is  tossed  500  times.   Find  the  probability 
that  the  number  of  heads  will  not  differ  from  250  by  (a) 
more  than  10,  (b)  more  than  50. 

15-3   If  10  per  cent  of  television  picture  tubes  burn  out 
before  their  guarantee  has  expired,  (a)  what  is  the  prob- 
ability that  a  merchant  who  has  sold  100  such  tubes  vrill 
be  forced  to  replace  at  least  20  of  them?   (b)   What  is 
the  probability  that  he  will  replace  at  least  5  and  not 
more  than  15  tubes?  Use  the  normal  curve  approximation. 


-j 

■"■i'lurray  R.  Spiegel,  Theory  and  Problems  of  Statistics. 


(Schaum  Publishing  Co.,  196TT  Prob.  25,  p.  1357 

2Paul  G.  Koel,  Elementary 
Inc.,  i960)  Prob.  15.  p."  69T 


2Paul  G.  Hoel,  Elementary  Statistics.  (John  Wiley  &  Sons 
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CHAPTER  16 
The  Central  Limit  Theorem  and  Large -Sample  Theory 
16.1  Background 

In  earlier  parts  of  Schlalfer's  text; considerable 
attention  was  paid  to  principles  of  mathematical  probability. 
Indeed,  we  have  periodically  returned  to  these  principles 
throughout  the  course.   One  of  the  most  important  uses  of 
mathematical  probability  in  statistics  is  its  application 
to  sampling  theory.   Not  only  are  sampling  techniques  used 
extensively  in  business  problems  as  stressed  in  the  basic 
text  but  also  in  social,  biological  and  physical  sciences, 
government,  and  humanities.   However,  statistics  and  sampling 
techniques  have  definite  limitations-sample  size  in  relation 
to  the  total  population,  representativeness,  definitions, 
inaccurate  measurement  or  classification,  inappropriate 
comparisons,  disregard  of  dispersion,  etc. — when  used  or 
interpreted  improperly.   In  fact,  a  book  has  been  written 

A 

on  rlow  To  Lie  with  Statlstlcsx  and  at  least  one  text  con- 

2 

tains  a  x-xhole  chapter  on  "Misuses  of  Statistics".   Two 

books  which  became  popular  best-sellers  in  the  1950* s  were 


1Huff ,  Darrell.  How  To  Lie  with  Statistics.  New  York:  W. 
W.  Norton  &  Company,  Inc.,  195*1-. 
2 
Wallis,  W.  Allen  and  Roberts,  Harry  V..  Statistics  a  New 

Approach.   The  Free  'Press  of  Glencoe  Inc.,  1962,  Chapter  3. 
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1 

the  Kinsey  Reports.   But  both  of  these  volumes  have  been  ' 

roundly  discredited  by  statisticians  who  cut  through  the 

» 
sensational  findings  to  examine  how  they  were  obtained. 

The  sampling  techniques  were  so  inadequate  to  most  statis-  / 

ticians  that  confidence  in  the  basic  significance  of  the. 

findings  was  diminished  such  that  the  books  have  little 

value  beyond  their  role  in  opening  a  broad  and  important 

field.2 

Sampling,  therefore,  is  an  important  part  of  statistics. 
How  do  we  go  about  obtaining  samples,  how  representative 
are  they,  how  much  freedom  are  we  allowed  in  interpretation', 
and  mathematical  manipulation,  what  inferences  can  be  drawn 
from  samples,  how  confident  are  we  that  a  sample  represents  ' 
the  true  population  from  which  drawn,  what "is  the  significance 
of  a  sample,  and  what  conclusions  can  be  drawn? 
16.2  Symbols 

Listed  belox-;  are  some  of  the  more  important  symbols 
used  in  sampling  theory: 

x  -  an  Individual  value  in  a  distribution  (example: 
the  wage,  ^2.00/hr.  of  one  man  in  a  factory) 


Kinsey,  Alfred  C, ,  Pomeroy,  V/ardell  B.,,and  Martin, 
Clyde  E.  Sexual  Behavior  in  the  Human  Kale .   Philadelphia 
and  London:   W.  3.  Saunders  Company,  19^8  and  Kinsey, 
Alfred  C.  and  Associates.   Sexual  Behavior  in  the  Human 
Female.  New  York:   E.  P.  Dutton  and  Co.,  1955." 

2Wallis,  W.  Allen.  "The  Statistics  of  the  Kinsey  Report." 
Journal  of  the  American  Statl^a^l  Association.  Vol.  *|4 
(19*1-9)  P.  '1-657 
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x  -  the  mean  of  a  sample  (example:  from  a  sample  of, 

say,  5  men  whose  wages  are  $1.80,  $1.95»  &2.00, 

$2.20  and  £2.25,  in  a  total  of  100  in  a  factory, 

a  mean  wage,  $2. Ok,  is  computed) 
t  -  the  total  sum  of  x's  (individual  values)  in  a 

sample  (example:  the  sum,  #10.20,  of  the  wages 

in  the  sample  of  5) 
U.  -  the  mean  of  all  the  x's  (individual  values)  in' 

a  population  (example:  the  mean  of  all  the  100 

wages  in  the  factory) 
<T  -  the  standard  deviation  of  all  the  x's  (individual 

values)  in  a  population 
d  x  -  the  Standard  deviation  of  the  sample  mean 

(example  developed  below) 
N  -  size' of  a  population,  the  number  of  x's  in  a 

4 

population  (example:  N  =  100  wages  of  workers 

in  the  above  cited  factory) 
n  -  number  of  items  in  a  sample  (example:  n  -  5 

above ) 
16.3  Sampling  with  and  without  replacement 

If  we  have  an  urn  with  a  total  of  100  red,  white ,  and 
blue  balls  in  it  and  start  drawing  out  a  sample  of  12  in 
order  to  estimate  the  true  proportion  of  each,  then  the 
probabilities  of  drawing  each  color  will  change  after  I. 
every  draw.   The  probability  of  drawing  any  one  color  is 
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conditional  upon  its  true  parameter,  p,  and  the  ones 
already  drawn.  This  is  called  sampling  without  replace- 
ment and  often  is  referred  to  as  sampling  from  a. finite 
population.   If  we  replace  a  ball  each  time  it  is  drawn  for 
our  sample  of  12 ,  then  each  draw  has  no  bearing  on  the  one, 
that  comes  after  it  and  the  probability  of  drawing  any  one 
color  is  conditional  upon  its  true  population  parameter 
only.   While  the  size  of  the  population  may  be  finite,  It 
can  be  considered  infinite  because  any  number  of  samples 
can  be  drawn  with  replacement  without  exhausting  the 
population.   There  are  many  practical  problems  in  which 
sampling  is  done  from  a  finite  population  but  is  treated 
as  infinite  because  of  its  large  size. 
16.^  The  Central  Limit  Theorem 

Although  a  distribution  can  have  a  graphical  shape  , 
other  than  Normal,  as  illustrated  in' Figure  16.1  of  the  ' 
basic  text,  most  of  the  problems  we  will  be  concerned  with 
approach  a  Normal  distribution.   In  this  connection,  how- 
ever, there  is  a  tendency  for  sampling  distributions  to 
assume  a  Normal  distribution.  This  is  ordinarily  true 
regardless  of  the  shape  of  the  population  from  which  the 
sample  was  drawn}  and  the  larger  the  sample,  the  more  closely 
does  it  approach  Normality.  This  tendency  is  called  the 
Central  Limit  Theorem  and  its  importance  lies  in  the  fact 
that  it  allows  us  to  apply  the  Normal  distribution  to 
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statistical  sampling  distributions. 

Except  in  the  comparitively  unusual  gamma  and  ex- 
ponential processes,  a  sample  will  approximate  a  Normal 
distribution  if:  (l)  the  sample  itself  is  large  enough 
in  absolute  size  to  approximate  a  Normal  distribution; 
that  is,  n  ^  30  (according  to  most  texts)  and  (2)  if  the 
population  size,  N,  is  large  enough  (at  least  twice  the 
sample  size)  to  make  the  sample  virtually  independent  of 
the  population. 
1°" . 5  The  Finite  Population  Correction 

If  the  sample  is  not  independent  of  the  population; 

that  is,  if  the  sample  takes  in  more  than  5%   of  the 

population,  then  the  finite  population  correction  should 

1 

be  applied,  using  the  formula  as  stated  in  Schlaifer: 

^T(t)  =irjn  j/l-  n 


or  as  stated  in  other  texts: 


r--  — -    J2L. 

U  x  -frT  |f  1-   N 

If  the  sample  is  less  than  $%   of  the  population,  which  is 
the  usual  case,  then  (1)  the  size  of  the  population  will 
not  affect  the  Normality  of  the  sample  providing  the  sample  • 
itself  is  large  enough  and  (2)  the  finite-population  correction 
can  be  disregarded  in  computing  the  standard  deviation  of 
the  Normal  approximation  since  it  is  neglible.  Thus  in 
most  practical  applications  it  is  safe  to  assume  that  the1 
distribution  of  sample  means  is  Normal  with  a  standard 
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deviation  of\l  x  -jt^ 

16.6  Estimating-  the  population  Standard  deviation  and 

9 

degrees  of  freedom 

Thus  far  we  have  seen  that  three  Statistical  theorems 
give  us  virtually  complete  information  about  sampling 
distributions : 

1)  The  central  limit  theorem  states  that  a  sample 
will  be  approximately  Normal  if  large  enough 

(  n  ^  30)., 

2)  The  mean,  x,  of  the  sample  means  is  always  equal 
to  the  population  mean;  that  is, 

x  =  4Azl  =Il  -  2T(x)  ,  where  the  first 
n    /      I'J 

sum  is  the  values  in  the  sample  and  the  second 

sum  is  the  values  in  the  population, 

3)  The  Standard  deviation  of  the  sample  means  is 
always  a  certain  multiple  of  the  standard  de- 
viation of  the  population;  that  is, 

Thus,  both  the  mean  and  standard  deviation  of  the 
sample . depend  on  the  corresponding  values  of  the 
population.  However,  in  most  practical  situations 
these  values  are  not  known.   In  fact,  one  reason 
for  sampling  is  to  make  an  estimate  of  what  these 
values  are  in  the  population. 
The  above  theorems  hold  true  for  large  samples 
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(n  -  30),  but  not  for  small.   This  can  be  seen  intuitively 

if  you  consider  a  sample  of  size  one  which  obviously  would 

not  be  equal  to  the  population  mean  except  by  coincidence. 

A  sample  value  is  likely  to  be  closer  to  its  own  mean 

(particularly  when  the  sample  is  small)  than  to  the 

population  mean.   This  again  is  intuitively  apparent,  for 

a  small  sample  will  not  likely  reproduce  the  ranp;e  of  a 

population.   Consequently  the  standard  deviation  of  a  small 

sample  will  tend  to  be  lower  than  the  standard  deviation 

of  the  population.   This  factor  can  be  compensated  for  by 

applying  a  correction  -to  the  sample  in  order  to  make  it 

a  more  realistic  estimate  of  the  population.   Since  the 

standard  deviation  of  the  sample  tends  to  be  smaller  than1 

that  of  the  population,  we  want  to  enlarge  it;  and  this 

can  be  done  by  using  a  larger  divisor  than  n  in  the  formula 

for  the  standard  deviation  of  a  sample;  thus  for  small 

samples  the  estimate  of  the  population  standard  deviation, 

S  fe  -  x)2"  , 
J     n-1 

The  degrees  of  freedom  are  the  number  of  items  in  a  sample 

that  are  effective  in  estimating  a  specified  characteristic 

of  a  population.   Using  the  factor  (n-l)  results  in  one 

degree  of  freedom  being  lost.  Examining  the  above  formula 

it  can  be  seen  (as  previously  stated)  that  decreasing  the 

denominator  from  n  to  (n-l)  will  increase  the  standard 
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deviation  when  the  sample  is  small  but  make  little  difference 
when  it  is  large.   For  most  practical  purposes  we  can  con- 
sider that  the  standard  deviation  of  the  population,  (f,   has 
the  same  value  as  its  estimate,  s,  providing  that  the  sample 
from  which  s  is  obtained  has  at  least  25  degrees  of  free- 
dom. 

1 
Problems 

1.  A  sample  of  ten  college  graduates  earned  a  mean 
of  $2,000  per  year  more  than  the  average  earnings 
of  high  school  graduates  with  a  standard  deviation 
of  C?500  more.  •  What  are  the  95  per  cent  confidence 
limits  of  the  mean  amount  by  which  all  college 
graduates'  annual  income  exceeds  the  mean  annual 
earnings  of  high  school  graduates? 

Answer:  $1,623-02,377. 

2.  A  man  claims  he  is  a  170  bowler.   For  ^9  games  he 
has  a  mean  score  of  165  with  a  standard  deviation 
of  21.   If  you  reject  his  claim,  what  is  the  prob- 
ability that"  you  have  made  a  mistake? 


Problems  1-^,8,9  and  10  are  taken  from  Perlman,  Richard. 
Problems  in  Statistics  for  Economics  and  Business  Students. 
New  York:.  Holt,  Rinehart  and  Winston,  Inc.,  1963;  problems 
5-7  are  from  Hosteller,  Frederick,  Rourke,  Robert  E.  K. , 
and  Thomas,  George  B.  Jr.  Probability  with  Statistical 
Applications.   Reading,  Mass:  Addis on -Wesley  Publishing 
Company,  Inc.,  196l;.and  problem  11  from  Neter,  John,  and 
Wasserman,  William.  Fundamental  Statistics  for  Business 
and  Economics.  Boston*  Ailyii  and  Bacon,  Inc. ,  19 61. 
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Answer :  . 0^8 

3.  A  company  conducting  surveys  of  consumer 

acceptance  of  new  product  pays  its  field  force 
by  the  interview.   It  suspects  that  the  inter- 
viewers, who  average  20  interviews  per  day,  are 
spending  too  little  time  on  interviews.  A 
company  official  goes  in  the  field  and  over  a, 
period  of  10  days  averages  18  interviews  per  day 
with  a  standard  deviation  of  five  per  day.   Using,' 
the  one  per  cent  level  of  significance,  should 
he  conclude  that  the  field  force  hurries  its 
interviews? 
Answer:  No 

LK   Golfer  A  knows  he  averages  &Q  strokes  per  round.  ' 
He  observes  that  Golfer  B  averages  86  strokes  for 
10  rounds  with  a  standard  deviation  of  6  per 
round.  How  many  strokes  handicap  per  round  can , » 
Golfer  A  give  Golfer  B  and  feel  95  per  cent  cer- 
tain of  beating  him  in  the  long'  run? 
Answer:  2.33  strokes 

5.  From  a  population  size  N  =  ^0,  a  sample  is  to  • 
be  drawn,  and  the  sample  average  is  to  be  used 
to  estimate  the  population  mean.   Which  provides 
a  more  accurate  estimate,  a'  sample  of  8  drawn 
without  replacement  or  a  sample  of  10  drawn  with 
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replacement  and  why? 

r"2 

Answer:  10  with  replacement  because v  /10  is  less 

than  4  0~  2/39 

•  * 

6.  A  certain  mental  test  yields  scores  in  months  of 
mental  age.  The  errors  of  measurement  in  this 
test  average  zero  in  the  long  run,  with  a  standard 
deviation  of  2  months.   If  a  class  of  J6   students 
takes  the  test,  what  is  the  probability  that  the  ■ 
average  score  for  the  class  is  in  error  by  one 
month  or  more? 

Answer:   .0026 

7.  In  crossing  two  pink  flowers  of  a  certain  variety, 
the  resulting  flowers  are  either  white ,  red,  or 
pink  with  respective  probabilities  6f  -| ,  Jy  and  -?. 
If  300  flowers  are  obtained  by  crossing  pink  flowers 
of  this  variety,  what  is  the  probability  that  90 

or  more  of  these  flowers  are  white? 
Answer:   .0228 
3.  A  drive  aims  for  a  goal  of  one  million  dollars  from 
a  college's  20,000  alumni.   A  'sample  of  26  responses 
yields  an  'average  contribution  of  $4fc  with  a  standard  , 
deviation  of  $20.   Based  on  the  sample  information, 
is  there  at  least  a  %   probability  of  reaching  the  • 
goal? 
Answer:  No.   The  maxicium  average  received  Is  $46.83 
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and  the  average  required  is  $50*00. 
9.  For  a  certain  occupation,  past  experience  in- 
dicates a  standard  deviation  of  wage  of  $.50 
per  hour.   How  large  must  a  sample  of  workers 
who  average  $2.50  be  in  order  that  the  95$ 
confidence  interval  has  a  range  of  $.25? 
Answer:   6l 
10.  A  research  report  stated  that  the  mean  income 
per  person  in  the  population  was  #^,500  + 
#800,  with  a  confidence  level -of  95$.  A  student 
interpreted  this  to  mean  that. 95$  °f  the  persons 
in  the  population  had  incomes  between  $3,700  and 
&5»300.  Another  student  interpreted  this  to 
mean  that  if  many  samples  were  taken,  95$  of 
them  would  have  sample  means  between  $3,700  and 
$5,300.   What  basic  mistakes  did  both  students 
make  ? 
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CHAPTER  1? 
Statistical  Decision  Rules  with  Normal  Sampling 
17.1  General  The  problems  discussed  In  Chapter  Seventeen 
of  the  basic  text  are  similar  to  those  of  Chapter  Ten  ex- 
cept that  they  are  based  on  the  Normal  distribution  instead 
of  the  binomial.   The  concept  of  the  Null  and  Alternate 

4 

Hypotheses,  Type  I  and  Type  II  Errors  and  Producers  and 
Consumer^  Risks  are  equally  applicable  here, 
17 • 2  Formulating  a  Decision  Rule 

The  decision  rule  is  determined  in  the  same  manner  as 
previously,  i.e.,  take  a  sample  of  size  n  (which  must  be 
sufficiently  large  for  a  Normal  distribution)  and  compute 
the  mean  x  of  this  sample.   If  x  is  greater  than  c  accept 
the  process.   The  key  to  this  procedure  is  the  selection 
of  the  decision  rule  or  (n,c)  pair.   Just  as  we  observed! 
in  Chapter  Ten,  the  value  of  c  can  be  changed  while  keeping 
n  fixed  and  thereby  decrease  the  probability  of  a  Type  I 
error,  but  only  at  the  expense  of  increasing  the  probability 
of  a  Type  II  error  or  vice  versa  depending  on  the  statement 
of  Hypotheses.  The  probability  of  both  type  errors  can  be 
reduced  by  increasing  n  and  changing  c  appropriately  but 
this  incurs  increased  sampling  cost.  However,  as  a  practical 
matter  it  is  so  difficult  to  formulate  a  decision  rule  by 
comparing  the  conditional  probabilities  of  wrong  decision 
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for  all  values  of  ym    under  all  possible  (n,c)  combinations 
that  the  decision  rule  is  usually  made  by  accepting  certain 
specified  levels  at  which  one  is  willing  to  be  in  error. 
This  procedure  is  outlined  on  page  276  ot   the  text  and  is 
almost  identical  to  that  contained  on  page  158  where  the 
procedure  for  deriving  a  decision  rule  under  the  Binomial 
Distribution  was  developed. 

17.3  The  Error  Characteristic  The  error  characteristic 
of  a  decision  rule  is  the  conditional  probability,  given 
any  xc   ,  that  the  decision  rule  will  lead  to  a  wrong  de- 
cision, that  is,  that  a  Type  I  or  II  error  will  be  made. 
Figures  17.2,  17. 3,  17. ^  and  17.5  of  the  text  illustrate 
the  error  characteristics  of  several  decision  rules. 
1? .4  '  Setting  the  Conditional  Probabilities  of  Error   In 
setting  the  conditional  probability  of  error  that  the  de- 
cision maker  is  willing  to  accept,  the  greater  his  require- 
ment for  accuracy,  and  the  less  will  be  his  acceptance  of 
error.   Thus  a  test  that  is  significant  at  the  .01  level 
has  a  greater  probability  of  reflecting  the  true  parameter 
of  the  population  than  a  test  at  the  .05  level.   In  any 
particular  decision  rule  of  this  type  the  sum  of  the  confidence 
level  and  the  significance  level  is  one.   Sometimes  It  is 
desireable  that  the  o<  and  ft   risks  be  balanced  in  which 
case  they  are  set  at  equal  levels.   On  the  other  hand  in  some 
processes  it  may  be  considered  that  changing  the  status  quo 
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for  the  worse  is  a  more  serious  error  than  failing  to 
change  it  for  the  better;  in  other  words,  altering  a  good 
process  may  be  more  serious  than  failing  to  alter  a  bad 
one.   For  this  reason,  there  are  many  processes  in  which  oC 
and  /3  are  deliberately  not  balanced,  and  the  oc    risk  is   , 
deliberately  set  at  a  lower  level  than  the  (3    risk.   Such 

is  the  case  in  an  illustrative  problem  that  follows. 

1 
17.5  Controlling  alpha  and  beta  on  a  two-tailed  test 

A  manufacturer  is  boxing  corn  flakes  in  family  size  • 
packages.   The  contents  should  weigh  16  ounces.   Manage- 
ment wishes  to  set  up  a  scheme  to  control  the  process;  they 
want  neither  too  much  nor  too  little  cereal  in  the  box. 
If  too  much  is  put  in  it  reduces  profits  and  if  too  little 
is  put  in  they  cheat  the  customer.   What  can  the  company 
statistician  do  to  help  set  up  the  controls? 

1)  He  can  recommend  a  type  of  sampling  design. 

2)  He  can  determine  the  proper  sample' size. 

3)  He  can  set  up  a  decision  rule.  ',  - 
Assuming  that  the  mean  ^U-  of  the  process  is  16  ounces 

and  the  standard  deviation  of  the  contents  of  the  package 
is  .k   ounce,  management  wishes  to  limit  the  probability  of 
making  a  Type  I  error  (alpha  risk)  to  two  per  cent.  Manage- 
ment also  desires  to  limit  the  (Bt   risk  to  5  per  cent  of 


1 
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concluding  that  the  boxes  contain'  16  ounces  on  the  average 
when  the  true  mean  is  15«75  ounces  and  the  ($&     risk  to 
5%   of  concluding  that  the  boxes  contain  an  average  of  16 
ounces  when  the  true  mean  is  16.25  ounces.   What  size 
sample  and  what  xc  s  should  the  manufacturer  employ  to 
fulfill  the  given . criteria.   Figure  17-1  is  a  diagram  which 
illustrates  the  problem. 

Figure  17-1 


15.75  Xol   16.0     xc2   16.25 
17«6  Computation  of  Sample  size  and  xci  and  xC2» 


xcl  -  16 


=  -2.33 


o<  =  .01 


.VVn 


=  1.6^5 


^=.05 


(l) 


(2) 


Solving  (1)  and  (2)  simultaneously  for  xcl  and  n 
we  have , 
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from  ( 1 ) , 

-.*[•  x  2.33 


xcl  -  16  = 


Vn  = 


>.4  x  2.33 


x.i  -16 


•cl 


from  (2),  substituting  for  Vn9 

*cl  "  15'75  =  -.1KX2.33   ("Cl  "  16) 

l.?07xcl  =  27.05 
xcl  =  15.85  , 


then 

yT=  -.fr  x  2.33 
15.85  -16 

ir-   "-932 
Vn  =  

-.15 
Vn  =  6.22 

n  =  38.7 
and  since  n  must  be  an  integer, 

n  =  39. 

To  compute  xc2  , 

xro   -  16  .  .    ■' 

— ^£ =  2.33 

.V6.22 

xc2  =  16.15.  ' 
Consequently  the  sampling  plan  is  to  select  a  sample 
of  size  39,  if  the  mean  contents  of  these  39  boxes,  x  is 
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between  15*85  ounces  and  16.15  ounces,  allow  the  process 
to  continue.   If  the  x-.  -  15.85  ounces,  or  x  ^  16.15  ounces, 
then  stop  the  machine  (accept  the  alternative)  and  determine 
the  cause.  Figure  17-2  is  the  power  curve  and  Figure  17-3' 
is  the  operating  characteristic  curve,  both  derived  from 
data  in  Table  17-1. 

,  • 

Table   17-1 
^U  P(Accepting  Hpl^  =y*  )    =/3  P(Rejectlng  K0\ji    ^/t)    =  o< 


15.00 

.0000 

15.85 

.5000 

16.00 

.9301 

16.15 

.5000 

17.00 

.0000 

1.0000 

.5000 

.0199 
.5000 

1.0000 


17.7   Controlling  Alpha  and  Beta  on  the  one -tailed  test 

Suppose  a  manufacturer  of  three-inch  nylon  line  for 

1 
the  U.  S.  Navy  has  discovered  a  new  manufacturing  process 

and  claims  that  his  new  process  makes  better  nylon  line 

with  a  mean  breaking  strength  of  28,000  lbs.  normally 

distributed  with  a  standard  deviation  of  2,^00  il/ba.   The 

standard  stock  three-inch  nylon  line  has  a  mean  breaking 
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strength  of  20,340  lbs.  with  a  standard  deviation  of 
2,400  lbs.   The  Bureau  of  Ships  decides  to  test  the  new 
line  and  sets  the  o<  and  /3  risks  at  the  .025  level.   What 
should  be  the  size  of  the  sample  n  and  the  xc  in  order  to 
meet  the  prescribed  criteria?  Develop  the  operating 
characteristic  and  the  power  curve  for  the  one-tailed 
test.  Figure  17-4  portrays  the  problem. 
(Note:   The  breaking  strength  has  been  converted  to  Tons 
for  ease  of  calculation. ) 

Figure  17-4 


17.8     Computation  of  Sample   size  n  and  x, 


x     -  10.17 

■  1.96 


1.2/Vn 


c^    =  .025 


(1) 


=  -  1.96 


ft    =  .025 


(2) 


1.2/tfn 

Solve   (1)  and   (2)   simultaneously  for  x0  and  n,   from  (1) 

/n  ■  1*96  x  1.2 
(x0   -  10.17) 
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and  substituting  for  tfn   in  (2),  we  have 
x0  -  14  »  -  (xc  -  10.1?) 
2.0xc  =  24.17  ' 
x0  -  12.1 
then  substituting  x0  in  (1),  we  have 

12.1  ■  10,  l7  =  x.96 
1/Kn 


Vn  - 


1.96 
1.6 


n  =  1.5 
but  n  must  be  an  integer,  so 

n  =  2. 

17.9  Developing  the  operating  Characteristic  and  Power  Curves. 

xc  ■  12.1 
(f~     =  1.2/  n  =  .85 

The  data  in  Table  17-2  have  been  computed  using  x0   =  12.1 
tons  and  £Tx  -.85.   Calculations  for  Table  17-2  and  pictorial 
diagrams  are  shown  below:  ™ 

Ea  -0 


u  = 

12.1   - 
.85 

12.1   - 
.85 

-   12.1   - 
.85 

14 

13  _ 

-  3.M 

-  2.24 

-  1.06 

(a 

=  .0003 

^ 

\ 

[/*< 

U   = 

a   .0126 

y\ 

\ 

±S         V 

U    = 

=  .1446 

/ 

\ 

c\ 

"0 

% 
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„    12.1  -  12      „ 
u — 118 


u  = 


u  = 


12.1  -  11 
.85 


12.1  -  10 


1.295 


u  = 


.85 


12.1  -  9 


=  2.^7 


ft   =  .■■■...5470 


f3  =  .9023 


ft  =   .9968 


1 

»' 

sx/y, 

<            '  * 

. 

u  = 


.85 


12.1  -  8 


«.  3. 


65  II  f3 


=  .9999 


.85 


4.92 


ft   =  1.0000 


Figure  17-5  is  plotted  from  the  above  data.   From  the 
operating  characteristic  curve,  we  see  that  the  probability 
of  keeping  the  old  nylon  line  when  the  new  breaking  strength 
is  less  than  10.5  tons  is  practically  1.0, except  for  the 
level  of  significance  of  .025  when  the  new  nylon  gives  a  ' 
mean  of  10. 17 tons.  It  then  falls  off  steeply  to  zero  so, 
that  there  is  practically  no  chance  of  continuing  to  stock 
the  old  line  when  the  mean  breaking  strength  exceeds  3> 
tons,  or  28,000  lbs.   The  power  curve  is  capable  of  the 
same  interpretation  as  for  the  operating  characteristic 
curve,  as  the  two  curves  are  essentially  equivalent  for 
this  purpose. 
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Figure  17-5 
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CHAPTER  18 

Expected  Loss  with  Normal  Sampling 

13.1  General  The  material  presented  in  Chapter  Eighteen 

of  the  basic  text  is  essentially  the  same  as  that  presented 
in  Chapters  Ten  and  Eleven  except  that  we  are  now  using  the 
Normal  Distribution  instead  of  the  Binomial. 

18.2  Three  courses  of  action  In  making  decisions  such  as 
whether  a  batch  of  material  is  good  and  should  be  used, 
whether  a  market  exists  for  a  new   product,  or  whether  a 
promotion  campaign  has  been  successful,  etc.,  the  decision 
maker  can  take  one  of  three  courses  of  action: 

(1)  He  can  accept  without  sampling  the  hypothesis  of  an 
affirmative  answer  to  the  above  questions;  that  is,  that 
the  true  //-   of  the  unknown  process  in  which  he  is  interest- 
ed is  beyond  the  breakeven  point.   However,  by  this  course 
he  incurs  a  risk  that  M     is  in  fact  less  than  the  breakeven 
point  and  consequently  a  loss  will  be  sustained  dependent 
upon  how  much  less  than  the  breakeven  point  jU-   actually  is. 

(2)  He  can  reject  without  sampling  the  hypothesis  of  an 
affirmative  answer  to  the  above  questions;  that  is,  that 
the  true Jj-   of  the  unknown  process  in  which  he  is  interested 
is  below  the  breakeven  point.   However,  by  this  course  he 
incurs  a  risk  that  1J~    is  in  fact  greater  than  the  breakeven 
point  and  a  loss  of  profit  will  be  sustained  dependent  upon 
how  much  greater  than  the  breakeven  point  LL    actually  is. 
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(3)  He  can  take  a  sample  of  the  unknown  process  and 
thereby  attempt  to  estimate  where  the  true  l/-,   actually  , 
Is — above  or  below  the  breakeven  point;  and  by  so  doing 
decide  whether  or  not  to  proceed  further  with  his  pro-  • 
duction,  advertising  or  promotion  campaign,  etc.   However, 
by  taking  a  sample  he  not  only  incurs  a  risk  that  the 
sample  is  incorrect  but  he  also  incurs  a  cost  of  sampling 
which  was  not  incurred  by  either  (l)  or  (2)  above;  and 
the  larger  the  sample,  the  more  expensive  its  cost.  . 
18.3  The  cost  of  each  course  of  action-Unconditional 
Expected  Total  Loss,   In  deciding  which  course  to  take, 
the  decision  maker  should  consider  the  costs  associated 
with  each.   These  can  be  computed  as  f ollows : 

1)  Accept  without  sampling.  If  the  true  U,    is  above 
the  breakeven  point,  no  loss  will  be  sustained,  but  any 
Lt-r   below  the  breakeven  point  will  result  in  a  loss  whose  ■ 
magnitude  is  conditional  upon  the  difference  between  the 
true  AL  and  the  breakeven  point.   Since  the  true  value 
of  ll     is  not  known  and  could  be  any  one  of  a  number  of 
values,  probabilities  are  assigned  to  the  occurrence  of 
these  various  possible  values.   Then,  the  conditional 
loss  that  would  occur  for  each  of  these  values  of  JJ~    is 
multiplied  by  the  probability  of  its  occurrence  and  results 
in  an  expected  loss  for  each  respective  >/-  .   If  we  sum  ' 
these  losses  we  will  get  the  sum  of  the  losses  for  the, 
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various  probabilities  of  occurrence.   This  sum,  then, 
is  the  best  estimate  of  the  loss  that  would  be  incurred 
by  making  an  affirmative  decision  without.' '■  sampling. 

2)  Reject  without  sampling  The  reasoning  here  is  the 
same  as  above  except  that  a  negative  decision  is  made 
without  sampling  on  the  assumption  that  the  true  it    is  . 
below  the  breakeven  point;  and  therefore  if  this  decision 
is  in  error,  then  a  loss  of  profit  will  be  incurred  de-  ' 
pendent  upon  how  far  above  the  true  JU   is  from  the  break- 
even point. 

3)  If  a  decision  is  made  paced  on  a  sample,  then  the 
expected  loss  for  each  possible  value  of  IP    will  be  the 
probability  that  the  sample  is  incorrect  multiplied  by 
the  conditional  loss  for  that  value  of  U-    times  the 
probability,  ?(/'-),  of  that  /A  oc curing.   The  unconditional 
expected  total  loss  will  be  the  sum  of  the  expected  losses 
plus  the  cost  of  sampling. 
18.^  The  importance  of  Lt      Prom  the  above  it  can  be  seen 
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that  the  true  value  of  jj-    determines  the  risks  which  a 
decision  maker  incurs  irrespective  of  what  course  of  action 
he  chooses  to  take.   This  also  means  that  the  magnitude 
of  profit  or  loss  is  determined  in  part  by  the  true  value 

Of  JJL     . 

18.5  The  breakeven  point  In  the  above  discussion  consider- 
able mention  has  been  made  of  the  breakeven  value  in 
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decision  processes.   This  is  the  point  at  which  it  makes 
no  difference  whether  the  decision  maker  accepts  or  rejects 
the  proposition  before  him  or,  in  other  words',  the  value 
of  LL    for  which  the  two  possible  acts,  accept  or  reject, 
are  equally  costly. 

18.6  The  subjective  assignment  of  /A   .   Considerable 
mention  has  also  been  made  of  the  probability,  P(/0,  of 
various  values  of  Lt-   occurring.   The  decision  maker,  of 
course,  is  seeking  mathematically  objective  information 
on  which  to  base  his  decision.   Yet  the  values  of  T(M-) 
may  be  quite  subjective — they  may  be  based  on  prior 
beliefs  about  a  process  (or  the  distribution  of  U-  ) ;  or 
on  the  other  hand,  the?/  may  be  quite  accurate,  having  been 
based  on  a  collection  of  historical  inf ormation.   Although 
generally  subjective,  ohe  values  assigned  to  FLu)    should 
be  based  on  the  most  accurate  information  available,  and 
in  this  connection,  later  material  in  the  text  provides 
a  better  method  of  assigning  values  to  PC^). 
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